Overview
Brought to you by YData
Dataset statistics
| Number of variables | 112 |
|---|---|
| Number of observations | 4515661 |
| Missing cells | 350029119 |
| Missing cells (%) | 69.2% |
| Total size in memory | 3.8 GiB |
| Average record size in memory | 896.0 B |
Variable types
| Text | 112 |
|---|
Dataset
| Description | US NMNH Extant Specimen Records 0052487-241126133413365 |
|---|---|
| URL | https://doi.org/10.15468/dl.wttrju |
institutionID has constant value "urn:lsid:biocol.org:col:15463" | Constant |
collectionID has constant value "urn:uuid:60e28f81-e634-4869-aa3e-732caed713c8" | Constant |
institutionCode has constant value "US" | Constant |
collectionCode has constant value "US" | Constant |
datasetName has constant value "NMNH Extant Biology" | Constant |
eventType has constant value "-7.38" | Constant |
samplingProtocol has constant value "400.0" | Constant |
countryCode has constant value "1872" | Constant |
verbatimSRS has constant value "San Francisco" | Constant |
lithostratigraphicTerms has constant value "500.0" | Constant |
bed has constant value "Riccardia pinguis" | Constant |
identificationID has constant value "Variety" | Constant |
taxonID has constant value "Metzgeriales" | Constant |
acceptedNameUsageID has constant value "Aneuraceae" | Constant |
namePublishedInID has constant value "Riccardia" | Constant |
nameAccordingTo has constant value "variety" | Constant |
nomenclaturalCode has constant value "Skog, Laurence E." | Constant |
taxonRemarks has constant value "Plantae" | Constant |
catalogNumber has 604654 (13.4%) missing values | Missing |
recordedBy has 54378 (1.2%) missing values | Missing |
lifeStage has 4152582 (92.0%) missing values | Missing |
preparations has 4381212 (97.0%) missing values | Missing |
associatedMedia has 318147 (7.0%) missing values | Missing |
associatedSequences has 4515310 (> 99.9%) missing values | Missing |
occurrenceRemarks has 4424129 (98.0%) missing values | Missing |
organismName has 4515658 (> 99.9%) missing values | Missing |
eventType has 4515660 (> 99.9%) missing values | Missing |
fieldNumber has 4515399 (> 99.9%) missing values | Missing |
eventDate has 499507 (11.1%) missing values | Missing |
startDayOfYear has 707937 (15.7%) missing values | Missing |
endDayOfYear has 706297 (15.6%) missing values | Missing |
year has 499507 (11.1%) missing values | Missing |
month has 700051 (15.5%) missing values | Missing |
day has 1180026 (26.1%) missing values | Missing |
verbatimEventDate has 2995056 (66.3%) missing values | Missing |
habitat has 4009333 (88.8%) missing values | Missing |
samplingProtocol has 4515660 (> 99.9%) missing values | Missing |
sampleSizeValue has 4515659 (> 99.9%) missing values | Missing |
locationID has 4473993 (99.1%) missing values | Missing |
continent has 66158 (1.5%) missing values | Missing |
waterBody has 4496088 (99.6%) missing values | Missing |
islandGroup has 4403077 (97.5%) missing values | Missing |
island has 4139168 (91.7%) missing values | Missing |
countryCode has 4515660 (> 99.9%) missing values | Missing |
stateProvince has 1002183 (22.2%) missing values | Missing |
county has 3778676 (83.7%) missing values | Missing |
locality has 332028 (7.4%) missing values | Missing |
verbatimLocality has 4515656 (> 99.9%) missing values | Missing |
minimumElevationInMeters has 2860984 (63.4%) missing values | Missing |
maximumElevationInMeters has 4017410 (89.0%) missing values | Missing |
minimumDepthInMeters has 4475632 (99.1%) missing values | Missing |
maximumDepthInMeters has 4478965 (99.2%) missing values | Missing |
verbatimDepth has 4494022 (99.5%) missing values | Missing |
decimalLatitude has 3845453 (85.2%) missing values | Missing |
decimalLongitude has 3845454 (85.2%) missing values | Missing |
geodeticDatum has 4485859 (99.3%) missing values | Missing |
coordinateUncertaintyInMeters has 4509192 (99.9%) missing values | Missing |
coordinatePrecision has 4515658 (> 99.9%) missing values | Missing |
pointRadiusSpatialFit has 4515656 (> 99.9%) missing values | Missing |
verbatimCoordinates has 4515657 (> 99.9%) missing values | Missing |
verbatimLatitude has 4477670 (99.2%) missing values | Missing |
verbatimLongitude has 4477686 (99.2%) missing values | Missing |
verbatimCoordinateSystem has 4478628 (99.2%) missing values | Missing |
verbatimSRS has 4515660 (> 99.9%) missing values | Missing |
footprintSpatialFit has 4515658 (> 99.9%) missing values | Missing |
georeferenceProtocol has 4388537 (97.2%) missing values | Missing |
georeferenceRemarks has 4515150 (> 99.9%) missing values | Missing |
geologicalContextID has 4515657 (> 99.9%) missing values | Missing |
earliestEonOrLowestEonothem has 4515654 (> 99.9%) missing values | Missing |
latestEonOrHighestEonothem has 4515654 (> 99.9%) missing values | Missing |
latestEraOrHighestErathem has 4515658 (> 99.9%) missing values | Missing |
earliestPeriodOrLowestSystem has 4515654 (> 99.9%) missing values | Missing |
earliestEpochOrLowestSeries has 4515654 (> 99.9%) missing values | Missing |
latestEpochOrHighestSeries has 4515659 (> 99.9%) missing values | Missing |
latestAgeOrHighestStage has 4515657 (> 99.9%) missing values | Missing |
lowestBiostratigraphicZone has 4515658 (> 99.9%) missing values | Missing |
highestBiostratigraphicZone has 4515658 (> 99.9%) missing values | Missing |
lithostratigraphicTerms has 4515660 (> 99.9%) missing values | Missing |
formation has 4515658 (> 99.9%) missing values | Missing |
member has 4515654 (> 99.9%) missing values | Missing |
bed has 4515660 (> 99.9%) missing values | Missing |
identificationID has 4515660 (> 99.9%) missing values | Missing |
identificationQualifier has 4504655 (99.8%) missing values | Missing |
typeStatus has 4399315 (97.4%) missing values | Missing |
identifiedBy has 3958097 (87.7%) missing values | Missing |
identifiedByID has 4515655 (> 99.9%) missing values | Missing |
dateIdentified has 4515654 (> 99.9%) missing values | Missing |
identificationReferences has 4515655 (> 99.9%) missing values | Missing |
identificationVerificationStatus has 4515654 (> 99.9%) missing values | Missing |
identificationRemarks has 4515654 (> 99.9%) missing values | Missing |
taxonID has 4515660 (> 99.9%) missing values | Missing |
scientificNameID has 4515655 (> 99.9%) missing values | Missing |
acceptedNameUsageID has 4515660 (> 99.9%) missing values | Missing |
nameAccordingToID has 4515655 (> 99.9%) missing values | Missing |
namePublishedInID has 4515660 (> 99.9%) missing values | Missing |
acceptedNameUsage has 4515655 (> 99.9%) missing values | Missing |
parentNameUsage has 4515659 (> 99.9%) missing values | Missing |
nameAccordingTo has 4515660 (> 99.9%) missing values | Missing |
namePublishedInYear has 4515656 (> 99.9%) missing values | Missing |
phylum has 3795307 (84.0%) missing values | Missing |
class has 166450 (3.7%) missing values | Missing |
order has 53019 (1.2%) missing values | Missing |
family has 49040 (1.1%) missing values | Missing |
subgenus has 4515572 (> 99.9%) missing values | Missing |
infraspecificEpithet has 4196068 (92.9%) missing values | Missing |
cultivarEpithet has 4515659 (> 99.9%) missing values | Missing |
taxonRank has 4196350 (92.9%) missing values | Missing |
scientificNameAuthorship has 491289 (10.9%) missing values | Missing |
vernacularName has 4515658 (> 99.9%) missing values | Missing |
nomenclaturalCode has 4515660 (> 99.9%) missing values | Missing |
nomenclaturalStatus has 4515659 (> 99.9%) missing values | Missing |
taxonRemarks has 4515659 (> 99.9%) missing values | Missing |
gbifID has unique values | Unique |
occurrenceID has unique values | Unique |
Reproduction
| Analysis started | 2025-01-14 16:41:47.797700 |
|---|---|
| Analysis finished | 2025-01-14 16:45:01.983197 |
| Duration | 3 minutes and 14.19 seconds |
| Software version | ydata-profiling vv4.12.1 |
| Download configuration | config.json |
Variables
gbifID
Text
Unique 
| Distinct | 4515661 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 34.5 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 4515661 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 1320179379 |
|---|---|
| 2nd row | 1675994101 |
| 3rd row | 2592240144 |
| 4th row | 2571494932 |
| 5th row | 3357270605 |
| Value | Count | Frequency (%) |
| 1320179379 | 1 | < 0.1% |
| 1320180447 | 1 | < 0.1% |
| 3897771070 | 1 | < 0.1% |
| 1320181031 | 1 | < 0.1% |
| 1321730416 | 1 | < 0.1% |
| 1321730340 | 1 | < 0.1% |
| 3467345455 | 1 | < 0.1% |
| 1456364699 | 1 | < 0.1% |
| 1321730091 | 1 | < 0.1% |
| 1320184062 | 1 | < 0.1% |
| Other values (4515651) | 4515651 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 6587745 | |
| 2 | 6301614 | |
| 3 | 5901629 | |
| 5 | 4291608 | |
| 6 | 3896698 | |
| 4 | 3891530 | |
| 7 | 3755094 | |
| 8 | 3664337 | |
| 0 | 3571035 | |
| 9 | 3295320 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 45156610 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 6587745 | |
| 2 | 6301614 | |
| 3 | 5901629 | |
| 5 | 4291608 | |
| 6 | 3896698 | |
| 4 | 3891530 | |
| 7 | 3755094 | |
| 8 | 3664337 | |
| 0 | 3571035 | |
| 9 | 3295320 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 45156610 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 6587745 | |
| 2 | 6301614 | |
| 3 | 5901629 | |
| 5 | 4291608 | |
| 6 | 3896698 | |
| 4 | 3891530 | |
| 7 | 3755094 | |
| 8 | 3664337 | |
| 0 | 3571035 | |
| 9 | 3295320 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 45156610 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 6587745 | |
| 2 | 6301614 | |
| 3 | 5901629 | |
| 5 | 4291608 | |
| 6 | 3896698 | |
| 4 | 3891530 | |
| 7 | 3755094 | |
| 8 | 3664337 | |
| 0 | 3571035 | |
| 9 | 3295320 |
modified
Text
| Distinct | 180669 |
|---|---|
| Distinct (%) | 4.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 34.5 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Unique
| Unique | 59498 ? |
|---|---|
| Unique (%) | 1.3% |
Sample
| 1st row | 2016-08-30 13:42:00 |
|---|---|
| 2nd row | 2022-10-26 17:57:00 |
| 3rd row | 2020-05-10 23:06:00 |
| 4th row | 2020-04-09 11:53:00 |
| 5th row | 2021-09-10 21:16:00 |
| Value | Count | Frequency (%) |
| 2017-08-04 | 233209 | 2.6% |
| 2022-10-26 | 209132 | 2.3% |
| 2022-06-03 | 121741 | 1.3% |
| 2022-09-08 | 97141 | 1.1% |
| 2017-12-19 | 94237 | 1.0% |
| 2022-06-02 | 84448 | 0.9% |
| 2024-10-17 | 76731 | 0.8% |
| 2016-08-29 | 71251 | 0.8% |
| 2016-08-30 | 70049 | 0.8% |
| 2019-07-12 | 61211 | 0.7% |
| Other values (3909) | 7912172 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 23062874 | |
| 2 | 11939180 | |
| 1 | 11301135 | |
| - | 9031322 | 10.5% |
| : | 9031322 | 10.5% |
| 4515661 | 5.3% | |
| 3 | 2960518 | 3.5% |
| 8 | 2607270 | 3.0% |
| 9 | 2572066 | 3.0% |
| 4 | 2388421 | 2.8% |
| Other values (3) | 6387790 | 7.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 63219254 | |
| Dash Punctuation | 9031322 | 10.5% |
| Other Punctuation | 9031322 | 10.5% |
| Space Separator | 4515661 | 5.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 23062874 | |
| 2 | 11939180 | |
| 1 | 11301135 | |
| 3 | 2960518 | 4.7% |
| 8 | 2607270 | 4.1% |
| 9 | 2572066 | 4.1% |
| 4 | 2388421 | 3.8% |
| 7 | 2315554 | 3.7% |
| 5 | 2112325 | 3.3% |
| 6 | 1959911 | 3.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 9031322 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 9031322 |
Space Separator
| Value | Count | Frequency (%) |
| 4515661 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 85797559 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 23062874 | |
| 2 | 11939180 | |
| 1 | 11301135 | |
| - | 9031322 | 10.5% |
| : | 9031322 | 10.5% |
| 4515661 | 5.3% | |
| 3 | 2960518 | 3.5% |
| 8 | 2607270 | 3.0% |
| 9 | 2572066 | 3.0% |
| 4 | 2388421 | 2.8% |
| Other values (3) | 6387790 | 7.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 85797559 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 23062874 | |
| 2 | 11939180 | |
| 1 | 11301135 | |
| - | 9031322 | 10.5% |
| : | 9031322 | 10.5% |
| 4515661 | 5.3% | |
| 3 | 2960518 | 3.5% |
| 8 | 2607270 | 3.0% |
| 9 | 2572066 | 3.0% |
| 4 | 2388421 | 2.8% |
| Other values (3) | 6387790 | 7.4% |
institutionID
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 34.5 MiB |
Length
| Max length | 29 |
|---|---|
| Median length | 29 |
| Mean length | 29 |
| Min length | 29 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | urn:lsid:biocol.org:col:15463 |
|---|---|
| 2nd row | urn:lsid:biocol.org:col:15463 |
| 3rd row | urn:lsid:biocol.org:col:15463 |
| 4th row | urn:lsid:biocol.org:col:15463 |
| 5th row | urn:lsid:biocol.org:col:15463 |
| Value | Count | Frequency (%) |
| urn:lsid:biocol.org:col:15463 | 4515661 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 18062644 | |
| : | 18062644 | |
| l | 13546983 | 10.3% |
| i | 9031322 | 6.9% |
| r | 9031322 | 6.9% |
| c | 9031322 | 6.9% |
| g | 4515661 | 3.4% |
| 6 | 4515661 | 3.4% |
| 4 | 4515661 | 3.4% |
| 5 | 4515661 | 3.4% |
| Other values (8) | 36125288 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 85797559 | |
| Other Punctuation | 22578305 | 17.2% |
| Decimal Number | 22578305 | 17.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 18062644 | |
| l | 13546983 | |
| i | 9031322 | |
| r | 9031322 | |
| c | 9031322 | |
| g | 4515661 | 5.3% |
| u | 4515661 | 5.3% |
| b | 4515661 | 5.3% |
| d | 4515661 | 5.3% |
| s | 4515661 | 5.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 4515661 | |
| 4 | 4515661 | |
| 5 | 4515661 | |
| 1 | 4515661 | |
| 3 | 4515661 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 18062644 | |
| . | 4515661 | 20.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 85797559 | |
| Common | 45156610 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 18062644 | |
| l | 13546983 | |
| i | 9031322 | |
| r | 9031322 | |
| c | 9031322 | |
| g | 4515661 | 5.3% |
| u | 4515661 | 5.3% |
| b | 4515661 | 5.3% |
| d | 4515661 | 5.3% |
| s | 4515661 | 5.3% |
Common
| Value | Count | Frequency (%) |
| : | 18062644 | |
| 6 | 4515661 | 10.0% |
| 4 | 4515661 | 10.0% |
| 5 | 4515661 | 10.0% |
| 1 | 4515661 | 10.0% |
| . | 4515661 | 10.0% |
| 3 | 4515661 | 10.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 130954169 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 18062644 | |
| : | 18062644 | |
| l | 13546983 | 10.3% |
| i | 9031322 | 6.9% |
| r | 9031322 | 6.9% |
| c | 9031322 | 6.9% |
| g | 4515661 | 3.4% |
| 6 | 4515661 | 3.4% |
| 4 | 4515661 | 3.4% |
| 5 | 4515661 | 3.4% |
| Other values (8) | 36125288 |
collectionID
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 34.5 MiB |
Length
| Max length | 45 |
|---|---|
| Median length | 45 |
| Mean length | 45 |
| Min length | 45 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | urn:uuid:60e28f81-e634-4869-aa3e-732caed713c8 |
|---|---|
| 2nd row | urn:uuid:60e28f81-e634-4869-aa3e-732caed713c8 |
| 3rd row | urn:uuid:60e28f81-e634-4869-aa3e-732caed713c8 |
| 4th row | urn:uuid:60e28f81-e634-4869-aa3e-732caed713c8 |
| 5th row | urn:uuid:60e28f81-e634-4869-aa3e-732caed713c8 |
| Value | Count | Frequency (%) |
| urn:uuid:60e28f81-e634-4869-aa3e-732caed713c8 | 4515661 |
Most occurring characters
| Value | Count | Frequency (%) |
| 8 | 18062644 | 8.9% |
| 3 | 18062644 | 8.9% |
| - | 18062644 | 8.9% |
| e | 18062644 | 8.9% |
| 6 | 13546983 | 6.7% |
| a | 13546983 | 6.7% |
| u | 13546983 | 6.7% |
| d | 9031322 | 4.4% |
| 2 | 9031322 | 4.4% |
| 1 | 9031322 | 4.4% |
| Other values (10) | 63219254 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 94828881 | |
| Lowercase Letter | 81281898 | |
| Dash Punctuation | 18062644 | 8.9% |
| Other Punctuation | 9031322 | 4.4% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 18062644 | |
| 3 | 18062644 | |
| 6 | 13546983 | |
| 2 | 9031322 | |
| 1 | 9031322 | |
| 4 | 9031322 | |
| 7 | 9031322 | |
| 0 | 4515661 | 4.8% |
| 9 | 4515661 | 4.8% |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 18062644 | |
| a | 13546983 | |
| u | 13546983 | |
| d | 9031322 | |
| c | 9031322 | |
| r | 4515661 | 5.6% |
| f | 4515661 | 5.6% |
| i | 4515661 | 5.6% |
| n | 4515661 | 5.6% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 18062644 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 9031322 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 121922847 | |
| Latin | 81281898 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 8 | 18062644 | |
| 3 | 18062644 | |
| - | 18062644 | |
| 6 | 13546983 | |
| 2 | 9031322 | |
| 1 | 9031322 | |
| : | 9031322 | |
| 4 | 9031322 | |
| 7 | 9031322 | |
| 0 | 4515661 | 3.7% |
Latin
| Value | Count | Frequency (%) |
| e | 18062644 | |
| a | 13546983 | |
| u | 13546983 | |
| d | 9031322 | |
| c | 9031322 | |
| r | 4515661 | 5.6% |
| f | 4515661 | 5.6% |
| i | 4515661 | 5.6% |
| n | 4515661 | 5.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 203204745 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 8 | 18062644 | 8.9% |
| 3 | 18062644 | 8.9% |
| - | 18062644 | 8.9% |
| e | 18062644 | 8.9% |
| 6 | 13546983 | 6.7% |
| a | 13546983 | 6.7% |
| u | 13546983 | 6.7% |
| d | 9031322 | 4.4% |
| 2 | 9031322 | 4.4% |
| 1 | 9031322 | 4.4% |
| Other values (10) | 63219254 |
institutionCode
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 34.5 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | US |
|---|---|
| 2nd row | US |
| 3rd row | US |
| 4th row | US |
| 5th row | US |
| Value | Count | Frequency (%) |
| us | 4515661 |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 4515661 | |
| S | 4515661 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 9031322 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 4515661 | |
| S | 4515661 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9031322 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 4515661 | |
| S | 4515661 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9031322 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 4515661 | |
| S | 4515661 |
collectionCode
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 34.5 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | US |
|---|---|
| 2nd row | US |
| 3rd row | US |
| 4th row | US |
| 5th row | US |
| Value | Count | Frequency (%) |
| us | 4515661 |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 4515661 | |
| S | 4515661 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 9031322 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 4515661 | |
| S | 4515661 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9031322 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 4515661 | |
| S | 4515661 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9031322 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 4515661 | |
| S | 4515661 |
datasetName
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 34.5 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NMNH Extant Biology |
|---|---|
| 2nd row | NMNH Extant Biology |
| 3rd row | NMNH Extant Biology |
| 4th row | NMNH Extant Biology |
| 5th row | NMNH Extant Biology |
| Value | Count | Frequency (%) |
| nmnh | 4515661 | |
| extant | 4515661 | |
| biology | 4515661 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 9031322 | 10.5% |
| 9031322 | 10.5% | |
| t | 9031322 | 10.5% |
| o | 9031322 | 10.5% |
| M | 4515661 | 5.3% |
| H | 4515661 | 5.3% |
| E | 4515661 | 5.3% |
| x | 4515661 | 5.3% |
| a | 4515661 | 5.3% |
| n | 4515661 | 5.3% |
| Other values (5) | 22578305 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 49672271 | |
| Uppercase Letter | 27093966 | |
| Space Separator | 9031322 | 10.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 9031322 | |
| o | 9031322 | |
| x | 4515661 | |
| a | 4515661 | |
| n | 4515661 | |
| i | 4515661 | |
| l | 4515661 | |
| g | 4515661 | |
| y | 4515661 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 9031322 | |
| M | 4515661 | |
| H | 4515661 | |
| E | 4515661 | |
| B | 4515661 |
Space Separator
| Value | Count | Frequency (%) |
| 9031322 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 76766237 | |
| Common | 9031322 | 10.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 9031322 | |
| t | 9031322 | |
| o | 9031322 | |
| M | 4515661 | 5.9% |
| H | 4515661 | 5.9% |
| E | 4515661 | 5.9% |
| x | 4515661 | 5.9% |
| a | 4515661 | 5.9% |
| n | 4515661 | 5.9% |
| B | 4515661 | 5.9% |
| Other values (4) | 18062644 |
Common
| Value | Count | Frequency (%) |
| 9031322 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 85797559 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 9031322 | 10.5% |
| 9031322 | 10.5% | |
| t | 9031322 | 10.5% |
| o | 9031322 | 10.5% |
| M | 4515661 | 5.3% |
| H | 4515661 | 5.3% |
| E | 4515661 | 5.3% |
| x | 4515661 | 5.3% |
| a | 4515661 | 5.3% |
| n | 4515661 | 5.3% |
| Other values (5) | 22578305 |
basisOfRecord
Text
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 34.5 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 17 |
| Mean length | 17.01103094 |
| Min length | 16 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PreservedSpecimen |
|---|---|
| 2nd row | PreservedSpecimen |
| 3rd row | PreservedSpecimen |
| 4th row | PreservedSpecimen |
| 5th row | PreservedSpecimen |
| Value | Count | Frequency (%) |
| preservedspecimen | 4465843 | |
| machineobservation | 49815 | 1.1% |
| humanobservation | 3 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 22428848 | |
| r | 8981504 | |
| n | 4565479 | 5.9% |
| i | 4565476 | 5.9% |
| s | 4515661 | 5.9% |
| v | 4515661 | 5.9% |
| c | 4515658 | 5.9% |
| m | 4465846 | 5.8% |
| P | 4465843 | 5.8% |
| p | 4465843 | 5.8% |
| Other values (11) | 9330230 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 67784727 | |
| Uppercase Letter | 9031322 | 11.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 22428848 | |
| r | 8981504 | |
| n | 4565479 | 6.7% |
| i | 4565476 | 6.7% |
| s | 4515661 | 6.7% |
| v | 4515661 | 6.7% |
| c | 4515658 | 6.7% |
| m | 4465846 | 6.6% |
| p | 4465843 | 6.6% |
| d | 4465843 | 6.6% |
| Other values (6) | 298908 | 0.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 4465843 | |
| S | 4465843 | |
| O | 49818 | 0.6% |
| M | 49815 | 0.6% |
| H | 3 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 76816049 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 22428848 | |
| r | 8981504 | |
| n | 4565479 | 5.9% |
| i | 4565476 | 5.9% |
| s | 4515661 | 5.9% |
| v | 4515661 | 5.9% |
| c | 4515658 | 5.9% |
| m | 4465846 | 5.8% |
| P | 4465843 | 5.8% |
| p | 4465843 | 5.8% |
| Other values (11) | 9330230 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 76816049 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 22428848 | |
| r | 8981504 | |
| n | 4565479 | 5.9% |
| i | 4565476 | 5.9% |
| s | 4515661 | 5.9% |
| v | 4515661 | 5.9% |
| c | 4515658 | 5.9% |
| m | 4465846 | 5.8% |
| P | 4465843 | 5.8% |
| p | 4465843 | 5.8% |
| Other values (11) | 9330230 |
occurrenceID
Text
Unique 
| Distinct | 4515661 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 34.5 MiB |
Length
| Max length | 63 |
|---|---|
| Median length | 63 |
| Mean length | 63 |
| Min length | 63 |
Unique
| Unique | 4515661 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | http://n2t.net/ark:/65665/383aab1ce-8b35-4007-8eba-472b592b7a99 |
|---|---|
| 2nd row | http://n2t.net/ark:/65665/3c8351e79-8b3b-4df0-80be-cb019ba60185 |
| 3rd row | http://n2t.net/ark:/65665/3c8377593-a51b-4b6a-835d-649053b2ef0f |
| 4th row | http://n2t.net/ark:/65665/383b388e9-b7cc-4b41-95cc-e0a1b092179a |
| 5th row | http://n2t.net/ark:/65665/3c83e5abc-b64e-45a4-aa42-faf5abc93792 |
| Value | Count | Frequency (%) |
| http://n2t.net/ark:/65665/383aab1ce-8b35-4007-8eba-472b592b7a99 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/383b6d73e-eb70-4b52-81b8-336878ca92f0 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3c84a8b17-83ab-45b2-bb8e-ccea78a7e003 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/383be1f82-08fe-4004-9374-3793b1df97c5 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3c842b3ee-b36e-41da-867e-a7c09def7524 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3c841ed6b-df48-4633-aae7-d3e846a86aa3 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/383b77fe7-0ea2-407e-bde8-bba5ef603c4a | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3c8411b46-27ee-4d70-ab07-e1bd72f2e83a | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3c83f60ef-2f0d-451e-986a-e0c2dfb03675 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/383e13640-df35-46e1-befc-1068e49e2444 | 1 | < 0.1% |
| Other values (4515651) | 4515651 |
Most occurring characters
| Value | Count | Frequency (%) |
| / | 22578305 | 7.9% |
| 6 | 22016321 | 7.7% |
| - | 18062644 | 6.3% |
| t | 18062644 | 6.3% |
| 5 | 17496925 | 6.2% |
| a | 14109308 | 5.0% |
| e | 12988442 | 4.6% |
| 4 | 12983714 | 4.6% |
| 2 | 12981604 | 4.6% |
| 3 | 12976062 | 4.6% |
| Other values (16) | 120230674 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 123048299 | |
| Lowercase Letter | 107250412 | |
| Other Punctuation | 36125288 | 12.7% |
| Dash Punctuation | 18062644 | 6.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 18062644 | |
| a | 14109308 | |
| e | 12988442 | |
| b | 9594278 | |
| n | 9031322 | |
| d | 8469759 | |
| c | 8468487 | |
| f | 8463528 | |
| k | 4515661 | 4.2% |
| r | 4515661 | 4.2% |
| Other values (2) | 9031322 |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 22016321 | |
| 5 | 17496925 | |
| 4 | 12983714 | |
| 2 | 12981604 | |
| 3 | 12976062 | |
| 9 | 9599378 | |
| 8 | 9594754 | |
| 1 | 8470306 | 6.9% |
| 0 | 8465201 | 6.9% |
| 7 | 8464034 | 6.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 22578305 | |
| : | 9031322 | 25.0% |
| . | 4515661 | 12.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 18062644 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 177236231 | |
| Latin | 107250412 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| / | 22578305 | |
| 6 | 22016321 | |
| - | 18062644 | |
| 5 | 17496925 | |
| 4 | 12983714 | |
| 2 | 12981604 | |
| 3 | 12976062 | |
| 9 | 9599378 | 5.4% |
| 8 | 9594754 | 5.4% |
| : | 9031322 | 5.1% |
| Other values (4) | 29915202 |
Latin
| Value | Count | Frequency (%) |
| t | 18062644 | |
| a | 14109308 | |
| e | 12988442 | |
| b | 9594278 | |
| n | 9031322 | |
| d | 8469759 | |
| c | 8468487 | |
| f | 8463528 | |
| k | 4515661 | 4.2% |
| r | 4515661 | 4.2% |
| Other values (2) | 9031322 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 284486643 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| / | 22578305 | 7.9% |
| 6 | 22016321 | 7.7% |
| - | 18062644 | 6.3% |
| t | 18062644 | 6.3% |
| 5 | 17496925 | 6.2% |
| a | 14109308 | 5.0% |
| e | 12988442 | 4.6% |
| 4 | 12983714 | 4.6% |
| 2 | 12981604 | 4.6% |
| 3 | 12976062 | 4.6% |
| Other values (16) | 120230674 |
catalogNumber
Text
Missing 
| Distinct | 3682462 |
|---|---|
| Distinct (%) | 94.2% |
| Missing | 604654 |
| Missing (%) | 13.4% |
| Memory size | 34.5 MiB |
Length
| Max length | 22 |
|---|---|
| Median length | 10 |
| Mean length | 9.635907069 |
| Min length | 4 |
Unique
| Unique | 3481841 ? |
|---|---|
| Unique (%) | 89.0% |
Sample
| 1st row | US 213621 |
|---|---|
| 2nd row | US 2144946 |
| 3rd row | US 3113222 |
| 4th row | US 2583825 |
| 5th row | US 3026466 |
| Value | Count | Frequency (%) |
| us | 3868231 | |
| sem | 238 | < 0.1% |
| 146 | < 0.1% | |
| stub | 135 | < 0.1% |
| 1 | 133 | < 0.1% |
| micrograph | 103 | < 0.1% |
| 169920 | 59 | < 0.1% |
| 2 | 44 | < 0.1% |
| 3 | 40 | < 0.1% |
| 95340 | 36 | < 0.1% |
| Other values (3682069) | 3910894 |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 3911382 | |
| U | 3911009 | |
| 3869052 | ||
| 2 | 3434134 | |
| 1 | 3365755 | |
| 3 | 3065165 | |
| 5 | 2346706 | 6.2% |
| 6 | 2337766 | 6.2% |
| 4 | 2336936 | 6.2% |
| 7 | 2291395 | 6.1% |
| Other values (38) | 6816800 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 25862216 | |
| Uppercase Letter | 7887319 | 20.9% |
| Space Separator | 3869052 | 10.3% |
| Lowercase Letter | 44582 | 0.1% |
| Dash Punctuation | 22833 | 0.1% |
| Close Punctuation | 41 | < 0.1% |
| Open Punctuation | 41 | < 0.1% |
| Other Punctuation | 16 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| w | 42787 | |
| r | 208 | 0.5% |
| u | 176 | 0.4% |
| a | 170 | 0.4% |
| t | 153 | 0.3% |
| b | 150 | 0.3% |
| o | 114 | 0.3% |
| p | 109 | 0.2% |
| i | 107 | 0.2% |
| m | 105 | 0.2% |
| Other values (10) | 503 | 1.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 3911382 | |
| U | 3911009 | |
| D | 39576 | 0.5% |
| A | 24456 | 0.3% |
| E | 382 | < 0.1% |
| M | 238 | < 0.1% |
| P | 144 | < 0.1% |
| B | 72 | < 0.1% |
| L | 48 | < 0.1% |
| V | 9 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 3434134 | |
| 1 | 3365755 | |
| 3 | 3065165 | |
| 5 | 2346706 | |
| 6 | 2337766 | |
| 4 | 2336936 | |
| 7 | 2291395 | |
| 0 | 2245737 | |
| 8 | 2221728 | |
| 9 | 2216894 |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 9 | |
| . | 6 | |
| ' | 1 | 6.2% |
Space Separator
| Value | Count | Frequency (%) |
| 3869052 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 22833 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 41 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 41 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 29754199 | |
| Latin | 7931901 | 21.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 3911382 | |
| U | 3911009 | |
| w | 42787 | 0.5% |
| D | 39576 | 0.5% |
| A | 24456 | 0.3% |
| E | 382 | < 0.1% |
| M | 238 | < 0.1% |
| r | 208 | < 0.1% |
| u | 176 | < 0.1% |
| a | 170 | < 0.1% |
| Other values (21) | 1517 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| 3869052 | ||
| 2 | 3434134 | |
| 1 | 3365755 | |
| 3 | 3065165 | |
| 5 | 2346706 | |
| 6 | 2337766 | |
| 4 | 2336936 | |
| 7 | 2291395 | |
| 0 | 2245737 | |
| 8 | 2221728 | |
| Other values (7) | 2239825 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 37686100 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 3911382 | |
| U | 3911009 | |
| 3869052 | ||
| 2 | 3434134 | |
| 1 | 3365755 | |
| 3 | 3065165 | |
| 5 | 2346706 | 6.2% |
| 6 | 2337766 | 6.2% |
| 4 | 2336936 | 6.2% |
| 7 | 2291395 | 6.1% |
| Other values (38) | 6816800 |
recordNumber
Text
| Distinct | 483515 |
|---|---|
| Distinct (%) | 10.8% |
| Missing | 39547 |
| Missing (%) | 0.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 100 |
|---|---|
| Median length | 93 |
| Mean length | 4.492128887 |
| Min length | 1 |
Unique
| Unique | 349259 ? |
|---|---|
| Unique (%) | 7.8% |
Sample
| 1st row | BLM-210-IV-11-B-TDS |
|---|---|
| 2nd row | 4319 |
| 3rd row | 2429 |
| 4th row | 95426 |
| 5th row | 1414/512 |
| Value | Count | Frequency (%) |
| s.n | 643632 | 13.5% |
| bureau | 20598 | 0.4% |
| eyd | 15904 | 0.3% |
| s | 14313 | 0.3% |
| of | 13865 | 0.3% |
| n | 13794 | 0.3% |
| science | 13409 | 0.3% |
| d&ml | 12897 | 0.3% |
| 12506 | 0.3% | |
| h | 8672 | 0.2% |
| Other values (337587) | 3987307 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2407969 | |
| 2 | 1865104 | |
| 3 | 1606272 | 8.0% |
| 4 | 1507264 | 7.5% |
| 0 | 1450162 | 7.2% |
| 5 | 1449381 | 7.2% |
| 6 | 1399430 | 7.0% |
| . | 1359512 | 6.8% |
| 7 | 1317204 | 6.6% |
| 8 | 1264749 | 6.3% |
| Other values (116) | 4480234 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 15496452 | |
| Lowercase Letter | 1833844 | 9.1% |
| Other Punctuation | 1441238 | 7.2% |
| Uppercase Letter | 729381 | 3.6% |
| Dash Punctuation | 306820 | 1.5% |
| Space Separator | 280783 | 1.4% |
| Open Punctuation | 7886 | < 0.1% |
| Close Punctuation | 7869 | < 0.1% |
| Other Number | 1652 | < 0.1% |
| Connector Punctuation | 693 | < 0.1% |
| Other values (8) | 663 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 687647 | |
| s | 674180 | |
| a | 73396 | 4.0% |
| e | 70280 | 3.8% |
| u | 48891 | 2.7% |
| r | 47050 | 2.6% |
| c | 40273 | 2.2% |
| o | 38076 | 2.1% |
| i | 35764 | 2.0% |
| t | 28937 | 1.6% |
| Other values (29) | 89350 | 4.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 70745 | 9.7% |
| B | 70158 | 9.6% |
| S | 66805 | 9.2% |
| D | 55914 | 7.7% |
| H | 47763 | 6.5% |
| L | 41753 | 5.7% |
| M | 40705 | 5.6% |
| E | 40361 | 5.5% |
| I | 34875 | 4.8% |
| N | 30792 | 4.2% |
| Other values (21) | 229510 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1359512 | |
| & | 24645 | 1.7% |
| / | 23862 | 1.7% |
| * | 14056 | 1.0% |
| ? | 10512 | 0.7% |
| , | 4971 | 0.3% |
| ! | 2281 | 0.2% |
| # | 372 | < 0.1% |
| : | 371 | < 0.1% |
| ; | 329 | < 0.1% |
| Other values (6) | 327 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2407969 | |
| 2 | 1865104 | |
| 3 | 1606272 | |
| 4 | 1507264 | |
| 0 | 1450162 | |
| 5 | 1449381 | |
| 6 | 1399430 | |
| 7 | 1317204 | |
| 8 | 1264749 | |
| 9 | 1228917 |
Other Number
| Value | Count | Frequency (%) |
| ½ | 1595 | |
| ² | 21 | 1.3% |
| ¼ | 17 | 1.0% |
| ¾ | 6 | 0.4% |
| ⅓ | 5 | 0.3% |
| ³ | 5 | 0.3% |
| ⁴ | 1 | 0.1% |
| ⅔ | 1 | 0.1% |
| ¹ | 1 | 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 510 | |
| + | 133 | 20.5% |
| ~ | 4 | 0.6% |
| ± | 1 | 0.2% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 7214 | |
| [ | 424 | 5.4% |
| { | 248 | 3.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 7198 | |
| ] | 423 | 5.4% |
| } | 248 | 3.2% |
Other Letter
| Value | Count | Frequency (%) |
| ª | 2 | |
| º | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 306820 |
Space Separator
| Value | Count | Frequency (%) |
| 280783 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 693 |
Final Punctuation
| Value | Count | Frequency (%) |
| › | 6 |
Modifier Letter
| Value | Count | Frequency (%) |
| ˍ | 2 |
Control
| Value | Count | Frequency (%) |
| | 1 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 1 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 1 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 17544053 | |
| Latin | 2563226 | 12.7% |
| Greek | 2 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 687647 | |
| s | 674180 | |
| a | 73396 | 2.9% |
| A | 70745 | 2.8% |
| e | 70280 | 2.7% |
| B | 70158 | 2.7% |
| S | 66805 | 2.6% |
| D | 55914 | 2.2% |
| u | 48891 | 1.9% |
| H | 47763 | 1.9% |
| Other values (61) | 697447 |
Common
| Value | Count | Frequency (%) |
| 1 | 2407969 | |
| 2 | 1865104 | |
| 3 | 1606272 | |
| 4 | 1507264 | |
| 0 | 1450162 | |
| 5 | 1449381 | |
| 6 | 1399430 | |
| . | 1359512 | |
| 7 | 1317204 | |
| 8 | 1264749 | |
| Other values (44) | 1917006 |
Greek
| Value | Count | Frequency (%) |
| Σ | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20105496 | |
| None | 1767 | < 0.1% |
| Punctuation | 10 | < 0.1% |
| Number Forms | 6 | < 0.1% |
| Modifier Letters | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2407969 | |
| 2 | 1865104 | |
| 3 | 1606272 | 8.0% |
| 4 | 1507264 | 7.5% |
| 0 | 1450162 | 7.2% |
| 5 | 1449381 | 7.2% |
| 6 | 1399430 | 7.0% |
| . | 1359512 | 6.8% |
| 7 | 1317204 | 6.6% |
| 8 | 1264749 | 6.3% |
| Other values (80) | 4478449 |
None
| Value | Count | Frequency (%) |
| ½ | 1595 | |
| è | 47 | 2.7% |
| ² | 21 | 1.2% |
| ¼ | 17 | 1.0% |
| ß | 12 | 0.7% |
| é | 11 | 0.6% |
| á | 9 | 0.5% |
| ó | 7 | 0.4% |
| ¾ | 6 | 0.3% |
| ü | 5 | 0.3% |
| Other values (21) | 37 | 2.1% |
Punctuation
| Value | Count | Frequency (%) |
| › | 6 | |
| … | 4 |
Number Forms
| Value | Count | Frequency (%) |
| ⅓ | 5 | |
| ⅔ | 1 | 16.7% |
Modifier Letters
| Value | Count | Frequency (%) |
| ˍ | 2 |
recordedBy
Text
Missing 
| Distinct | 148160 |
|---|---|
| Distinct (%) | 3.3% |
| Missing | 54378 |
| Missing (%) | 1.2% |
| Memory size | 34.5 MiB |
Length
| Max length | 207 |
|---|---|
| Median length | 182 |
| Mean length | 17.24845678 |
| Min length | 1 |
Unique
| Unique | 68031 ? |
|---|---|
| Unique (%) | 1.5% |
Sample
| 1st row | Continental Shelf Associates for the MMS/BLM |
|---|---|
| 2nd row | J. Soukup |
| 3rd row | I. Morel |
| 4th row | J. Steyermark & Cora Steyermark |
| 5th row | A. Oakes & -. Ellis |
| Value | Count | Frequency (%) |
| 1250739 | 7.3% | |
| j | 893043 | 5.2% |
| a | 765542 | 4.5% |
| r | 679021 | 4.0% |
| e | 678845 | 4.0% |
| c | 633969 | 3.7% |
| m | 612375 | 3.6% |
| h | 550473 | 3.2% |
| l | 447598 | 2.6% |
| w | 441682 | 2.6% |
| Other values (47910) | 10064443 |
Most occurring characters
| Value | Count | Frequency (%) |
| 12556447 | ||
| . | 9149950 | 11.9% |
| e | 4946918 | 6.4% |
| r | 3620522 | 4.7% |
| a | 3597020 | 4.7% |
| o | 3035449 | 3.9% |
| n | 3020373 | 3.9% |
| l | 2882126 | 3.7% |
| i | 2489174 | 3.2% |
| t | 2009722 | 2.6% |
| Other values (159) | 29642546 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 37329688 | |
| Uppercase Letter | 15658675 | |
| Space Separator | 12556447 | 16.3% |
| Other Punctuation | 11032380 | 14.3% |
| Dash Punctuation | 338372 | 0.4% |
| Decimal Number | 13770 | < 0.1% |
| Open Punctuation | 10381 | < 0.1% |
| Close Punctuation | 10377 | < 0.1% |
| Math Symbol | 95 | < 0.1% |
| Modifier Symbol | 37 | < 0.1% |
| Other values (5) | 25 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 4946918 | |
| r | 3620522 | |
| a | 3597020 | |
| o | 3035449 | 8.1% |
| n | 3020373 | 8.1% |
| l | 2882126 | 7.7% |
| i | 2489174 | 6.7% |
| t | 2009722 | 5.4% |
| s | 1900211 | 5.1% |
| u | 1165862 | 3.1% |
| Other values (72) | 8662311 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1201916 | 7.7% |
| S | 1183226 | 7.6% |
| M | 1107889 | 7.1% |
| R | 1107469 | 7.1% |
| H | 1091576 | 7.0% |
| A | 1069398 | 6.8% |
| J | 1064721 | 6.8% |
| E | 869534 | 5.6% |
| B | 829485 | 5.3% |
| L | 808858 | 5.2% |
| Other values (37) | 5324603 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 9149950 | |
| & | 1104307 | 10.0% |
| , | 759260 | 6.9% |
| ' | 13249 | 0.1% |
| / | 3911 | < 0.1% |
| " | 1654 | < 0.1% |
| ; | 19 | < 0.1% |
| ? | 17 | < 0.1% |
| ¡ | 5 | < 0.1% |
| : | 3 | < 0.1% |
| Other values (4) | 5 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 2642 | |
| 1 | 2372 | |
| 9 | 2198 | |
| 0 | 2008 | |
| 3 | 1492 | |
| 4 | 1264 | |
| 5 | 1196 | |
| 2 | 395 | 2.9% |
| 7 | 191 | 1.4% |
| 6 | 12 | 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 10272 | |
| [ | 108 | 1.0% |
| ‚ | 1 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 10270 | |
| ] | 107 | 1.0% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 93 | |
| + | 2 | 2.1% |
Control
| Value | Count | Frequency (%) |
| | 1 | |
| | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 12556447 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 338372 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 37 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 14 |
Final Punctuation
| Value | Count | Frequency (%) |
| » | 4 |
Initial Punctuation
| Value | Count | Frequency (%) |
| « | 4 |
Currency Symbol
| Value | Count | Frequency (%) |
| ¢ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 52988360 | |
| Common | 23961884 | |
| Cyrillic | 2 | < 0.1% |
| Greek | 1 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 4946918 | 9.3% |
| r | 3620522 | 6.8% |
| a | 3597020 | 6.8% |
| o | 3035449 | 5.7% |
| n | 3020373 | 5.7% |
| l | 2882126 | 5.4% |
| i | 2489174 | 4.7% |
| t | 2009722 | 3.8% |
| s | 1900211 | 3.6% |
| C | 1201916 | 2.3% |
| Other values (117) | 24284929 |
Common
| Value | Count | Frequency (%) |
| 12556447 | ||
| . | 9149950 | |
| & | 1104307 | 4.6% |
| , | 759260 | 3.2% |
| - | 338372 | 1.4% |
| ' | 13249 | 0.1% |
| ( | 10272 | < 0.1% |
| ) | 10270 | < 0.1% |
| / | 3911 | < 0.1% |
| 8 | 2642 | < 0.1% |
| Other values (30) | 13204 | 0.1% |
Cyrillic
| Value | Count | Frequency (%) |
| Ӧ | 2 |
Greek
| Value | Count | Frequency (%) |
| β | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 76700430 | |
| None | 249806 | 0.3% |
| IPA Ext | 8 | < 0.1% |
| Cyrillic | 2 | < 0.1% |
| Punctuation | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 12556447 | ||
| . | 9149950 | 11.9% |
| e | 4946918 | 6.4% |
| r | 3620522 | 4.7% |
| a | 3597020 | 4.7% |
| o | 3035449 | 4.0% |
| n | 3020373 | 3.9% |
| l | 2882126 | 3.8% |
| i | 2489174 | 3.2% |
| t | 2009722 | 2.6% |
| Other values (72) | 29392729 |
None
| Value | Count | Frequency (%) |
| á | 42487 | |
| é | 42189 | |
| ó | 38921 | |
| í | 28776 | |
| ñ | 24979 | |
| è | 17057 | |
| ü | 13598 | 5.4% |
| ö | 10723 | 4.3% |
| ê | 6878 | 2.8% |
| ç | 3147 | 1.3% |
| Other values (74) | 21051 |
IPA Ext
| Value | Count | Frequency (%) |
| ɶ | 8 |
Cyrillic
| Value | Count | Frequency (%) |
| Ӧ | 2 |
Punctuation
| Value | Count | Frequency (%) |
| ‚ | 1 |
individualCount
Text
| Distinct | 22 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 560 |
| Missing (%) | < 0.1% |
| Memory size | 34.5 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.000007973 |
| Min length | 1 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 1 | 4513894 | |
| 2 | 489 | < 0.1% |
| 0 | 306 | < 0.1% |
| 3 | 137 | < 0.1% |
| 4 | 94 | < 0.1% |
| 5 | 55 | < 0.1% |
| 6 | 40 | < 0.1% |
| 7 | 21 | < 0.1% |
| 8 | 16 | < 0.1% |
| 9 | 13 | < 0.1% |
| Other values (12) | 36 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 4513935 | |
| 2 | 496 | < 0.1% |
| 0 | 315 | < 0.1% |
| 3 | 142 | < 0.1% |
| 4 | 96 | < 0.1% |
| 5 | 57 | < 0.1% |
| 6 | 42 | < 0.1% |
| 7 | 22 | < 0.1% |
| 8 | 17 | < 0.1% |
| 9 | 15 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4515137 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 4513935 | |
| 2 | 496 | < 0.1% |
| 0 | 315 | < 0.1% |
| 3 | 142 | < 0.1% |
| 4 | 96 | < 0.1% |
| 5 | 57 | < 0.1% |
| 6 | 42 | < 0.1% |
| 7 | 22 | < 0.1% |
| 8 | 17 | < 0.1% |
| 9 | 15 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4515137 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 4513935 | |
| 2 | 496 | < 0.1% |
| 0 | 315 | < 0.1% |
| 3 | 142 | < 0.1% |
| 4 | 96 | < 0.1% |
| 5 | 57 | < 0.1% |
| 6 | 42 | < 0.1% |
| 7 | 22 | < 0.1% |
| 8 | 17 | < 0.1% |
| 9 | 15 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4515137 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 4513935 | |
| 2 | 496 | < 0.1% |
| 0 | 315 | < 0.1% |
| 3 | 142 | < 0.1% |
| 4 | 96 | < 0.1% |
| 5 | 57 | < 0.1% |
| 6 | 42 | < 0.1% |
| 7 | 22 | < 0.1% |
| 8 | 17 | < 0.1% |
| 9 | 15 | < 0.1% |
lifeStage
Text
Missing 
| Distinct | 117 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4152582 |
| Missing (%) | 92.0% |
| Memory size | 34.5 MiB |
Length
| Max length | 60 |
|---|---|
| Median length | 9 |
| Mean length | 10.22810187 |
| Min length | 1 |
Unique
| Unique | 33 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Fruiting |
|---|---|
| 2nd row | In bud |
| 3rd row | Flowering |
| 4th row | Flowering |
| 5th row | Immature fruit |
| Value | Count | Frequency (%) |
| flowering | 233588 | |
| fruiting | 101100 | |
| and | 42692 | 9.2% |
| vegetative | 23865 | 5.1% |
| fertile | 18492 | 4.0% |
| in | 8662 | 1.9% |
| bud | 8099 | 1.7% |
| flower | 7510 | 1.6% |
| fruit | 7391 | 1.6% |
| sterile | 3316 | 0.7% |
| Other values (52) | 9209 | 2.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 489124 | |
| n | 386365 | |
| r | 377645 | |
| e | 366016 | |
| g | 358832 | |
| F | 358692 | |
| l | 267441 | |
| o | 243782 | |
| w | 243567 | |
| t | 181837 | 4.9% |
| Other values (34) | 440308 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3213326 | |
| Uppercase Letter | 398816 | 10.7% |
| Space Separator | 100845 | 2.7% |
| Other Punctuation | 608 | < 0.1% |
| Close Punctuation | 7 | < 0.1% |
| Open Punctuation | 7 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 489124 | |
| n | 386365 | |
| r | 377645 | |
| e | 366016 | |
| g | 358832 | |
| l | 267441 | |
| o | 243782 | |
| w | 243567 | |
| t | 181837 | 5.7% |
| u | 120719 | 3.8% |
| Other values (13) | 177998 | 5.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 358692 | |
| V | 23712 | 5.9% |
| I | 10944 | 2.7% |
| S | 3091 | 0.8% |
| M | 1845 | 0.5% |
| B | 367 | 0.1% |
| Y | 156 | < 0.1% |
| C | 3 | < 0.1% |
| P | 3 | < 0.1% |
| J | 2 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 330 | |
| & | 178 | |
| ; | 52 | 8.6% |
| . | 30 | 4.9% |
| , | 18 | 3.0% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 6 | |
| ) | 1 | 14.3% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 6 | |
| ( | 1 | 14.3% |
Space Separator
| Value | Count | Frequency (%) |
| 100845 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3612142 | |
| Common | 101467 | 2.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 489124 | |
| n | 386365 | |
| r | 377645 | |
| e | 366016 | |
| g | 358832 | |
| F | 358692 | |
| l | 267441 | |
| o | 243782 | |
| w | 243567 | |
| t | 181837 | 5.0% |
| Other values (24) | 338841 |
Common
| Value | Count | Frequency (%) |
| 100845 | ||
| ? | 330 | 0.3% |
| & | 178 | 0.2% |
| ; | 52 | 0.1% |
| . | 30 | < 0.1% |
| , | 18 | < 0.1% |
| ] | 6 | < 0.1% |
| [ | 6 | < 0.1% |
| ( | 1 | < 0.1% |
| ) | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3713599 | |
| None | 10 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 489124 | |
| n | 386365 | |
| r | 377645 | |
| e | 366016 | |
| g | 358832 | |
| F | 358692 | |
| l | 267441 | |
| o | 243782 | |
| w | 243567 | |
| t | 181837 | 4.9% |
| Other values (33) | 440298 |
None
| Value | Count | Frequency (%) |
| í | 10 |
preparations
Text
Missing 
| Distinct | 117 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 4381212 |
| Missing (%) | 97.0% |
| Memory size | 34.5 MiB |
Length
| Max length | 154 |
|---|---|
| Median length | 142 |
| Mean length | 13.21074906 |
| Min length | 3 |
Unique
| Unique | 35 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Wood Sample |
|---|---|
| 2nd row | Photograph |
| 3rd row | Microslide |
| 4th row | Photograph |
| 5th row | Photograph; Photograph |
| Value | Count | Frequency (%) |
| sample | 42484 | |
| wood | 42481 | |
| microslide | 41833 | |
| photograph | 33523 | |
| individual | 18796 | |
| strewn | 10233 | 4.5% |
| sem | 6926 | 3.1% |
| micrograph | 6518 | 2.9% |
| ink | 5466 | 2.4% |
| and | 3022 | 1.3% |
| Other values (80) | 15061 | 6.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 203882 | 11.5% |
| i | 158041 | 8.9% |
| d | 125573 | 7.1% |
| l | 110697 | 6.2% |
| a | 110082 | 6.2% |
| r | 106206 | 6.0% |
| e | 101621 | 5.7% |
| 91894 | 5.2% | |
| p | 85868 | 4.8% |
| h | 73865 | 4.2% |
| Other values (45) | 608443 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1435215 | |
| Uppercase Letter | 188766 | 10.6% |
| Space Separator | 91894 | 5.2% |
| Open Punctuation | 29029 | 1.6% |
| Close Punctuation | 29029 | 1.6% |
| Other Punctuation | 2234 | 0.1% |
| Decimal Number | 5 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 203882 | |
| i | 158041 | |
| d | 125573 | |
| l | 110697 | 7.7% |
| a | 110082 | 7.7% |
| r | 106206 | 7.4% |
| e | 101621 | 7.1% |
| p | 85868 | 6.0% |
| h | 73865 | 5.1% |
| s | 53207 | 3.7% |
| Other values (16) | 306173 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 49907 | |
| M | 48777 | |
| W | 43261 | |
| P | 33419 | |
| E | 7153 | 3.8% |
| B | 2795 | 1.5% |
| I | 2467 | 1.3% |
| F | 648 | 0.3% |
| D | 226 | 0.1% |
| T | 77 | < 0.1% |
| Other values (9) | 36 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2 | |
| 5 | 1 | |
| 2 | 1 | |
| 6 | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 2198 | |
| , | 35 | 1.6% |
| & | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 91894 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 29029 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 29029 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1623981 | |
| Common | 152191 | 8.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 203882 | |
| i | 158041 | 9.7% |
| d | 125573 | 7.7% |
| l | 110697 | 6.8% |
| a | 110082 | 6.8% |
| r | 106206 | 6.5% |
| e | 101621 | 6.3% |
| p | 85868 | 5.3% |
| h | 73865 | 4.5% |
| s | 53207 | 3.3% |
| Other values (35) | 494939 |
Common
| Value | Count | Frequency (%) |
| 91894 | ||
| ( | 29029 | 19.1% |
| ) | 29029 | 19.1% |
| ; | 2198 | 1.4% |
| , | 35 | < 0.1% |
| 0 | 2 | < 0.1% |
| & | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
| 2 | 1 | < 0.1% |
| 6 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1776172 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 203882 | 11.5% |
| i | 158041 | 8.9% |
| d | 125573 | 7.1% |
| l | 110697 | 6.2% |
| a | 110082 | 6.2% |
| r | 106206 | 6.0% |
| e | 101621 | 5.7% |
| 91894 | 5.2% | |
| p | 85868 | 4.8% |
| h | 73865 | 4.2% |
| Other values (45) | 608443 |
associatedMedia
Text
Missing 
| Distinct | 4172762 |
|---|---|
| Distinct (%) | 99.4% |
| Missing | 318147 |
| Missing (%) | 7.0% |
| Memory size | 34.5 MiB |
Length
| Max length | 1040 |
|---|---|
| Median length | 49 |
| Mean length | 49.74545934 |
| Min length | 48 |
Unique
| Unique | 4151953 ? |
|---|---|
| Unique (%) | 98.9% |
Sample
| 1st row | https://collections.nmnh.si.edu/media/?i=12410529 |
|---|---|
| 2nd row | https://collections.nmnh.si.edu/media/?i=14440219 |
| 3rd row | https://collections.nmnh.si.edu/media/?i=14306337 |
| 4th row | https://collections.nmnh.si.edu/media/?i=15522674 |
| 5th row | https://collections.nmnh.si.edu/media/?i=15293772 |
| Value | Count | Frequency (%) |
| 16574494 | 50 | < 0.1% |
| https://collections.nmnh.si.edu/media/?i=13965384 | 42 | < 0.1% |
| 16580564 | 35 | < 0.1% |
| 16582219 | 30 | < 0.1% |
| https://collections.nmnh.si.edu/media/?i=15413125 | 25 | < 0.1% |
| https://collections.nmnh.si.edu/media/?i=16645032 | 22 | < 0.1% |
| https://collections.nmnh.si.edu/media/?i=15413478 | 21 | < 0.1% |
| https://collections.nmnh.si.edu/media/?i=10422983 | 19 | < 0.1% |
| https://collections.nmnh.si.edu/media/?i=15921416 | 17 | < 0.1% |
| https://collections.nmnh.si.edu/media/?i=16645082 | 16 | < 0.1% |
| Other values (4474099) | 4510811 |
Most occurring characters
| Value | Count | Frequency (%) |
| / | 16790056 | 8.0% |
| i | 16790056 | 8.0% |
| s | 12592542 | 6.0% |
| e | 12592542 | 6.0% |
| n | 12592542 | 6.0% |
| . | 12592542 | 6.0% |
| t | 12592542 | 6.0% |
| h | 8395028 | 4.0% |
| c | 8395028 | 4.0% |
| o | 8395028 | 4.0% |
| Other values (21) | 87079356 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 130122934 | |
| Other Punctuation | 38091203 | 18.2% |
| Decimal Number | 36082037 | 17.3% |
| Math Symbol | 4197514 | 2.0% |
| Space Separator | 313574 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 16790056 | |
| s | 12592542 | |
| e | 12592542 | |
| n | 12592542 | |
| t | 12592542 | |
| h | 8395028 | 6.5% |
| c | 8395028 | 6.5% |
| o | 8395028 | 6.5% |
| l | 8395028 | 6.5% |
| m | 8395028 | 6.5% |
| Other values (4) | 20987570 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 8143819 | |
| 2 | 3518459 | |
| 5 | 3514642 | |
| 3 | 3509019 | |
| 4 | 3458224 | |
| 0 | 3078625 | 8.5% |
| 6 | 3018330 | 8.4% |
| 9 | 2670129 | 7.4% |
| 7 | 2613317 | 7.2% |
| 8 | 2557473 | 7.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 16790056 | |
| . | 12592542 | |
| ? | 4197514 | 11.0% |
| : | 4197514 | 11.0% |
| ; | 313577 | 0.8% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 4197514 |
Space Separator
| Value | Count | Frequency (%) |
| 313574 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 130122934 | |
| Common | 78684328 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| / | 16790056 | |
| . | 12592542 | |
| 1 | 8143819 | |
| ? | 4197514 | 5.3% |
| = | 4197514 | 5.3% |
| : | 4197514 | 5.3% |
| 2 | 3518459 | 4.5% |
| 5 | 3514642 | 4.5% |
| 3 | 3509019 | 4.5% |
| 4 | 3458224 | 4.4% |
| Other values (7) | 14565025 |
Latin
| Value | Count | Frequency (%) |
| i | 16790056 | |
| s | 12592542 | |
| e | 12592542 | |
| n | 12592542 | |
| t | 12592542 | |
| h | 8395028 | 6.5% |
| c | 8395028 | 6.5% |
| o | 8395028 | 6.5% |
| l | 8395028 | 6.5% |
| m | 8395028 | 6.5% |
| Other values (4) | 20987570 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 208807262 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| / | 16790056 | 8.0% |
| i | 16790056 | 8.0% |
| s | 12592542 | 6.0% |
| e | 12592542 | 6.0% |
| n | 12592542 | 6.0% |
| . | 12592542 | 6.0% |
| t | 12592542 | 6.0% |
| h | 8395028 | 4.0% |
| c | 8395028 | 4.0% |
| o | 8395028 | 4.0% |
| Other values (21) | 87079356 |
Missing 
| Distinct | 334 |
|---|---|
| Distinct (%) | 95.2% |
| Missing | 4515310 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 499 |
|---|---|
| Median length | 249 |
| Mean length | 140.8803419 |
| Min length | 49 |
Unique
| Unique | 317 ? |
|---|---|
| Unique (%) | 90.3% |
Sample
| 1st row | https://www.ncbi.nlm.nih.gov/gquery?term=ON553270 |
|---|---|
| 2nd row | https://www.ncbi.nlm.nih.gov/gquery?term=MT553291 |
| 3rd row | https://www.ncbi.nlm.nih.gov/gquery?term=MT553246 |
| 4th row | https://www.ncbi.nlm.nih.gov/gquery?term=MT553283 |
| 5th row | https://www.ncbi.nlm.nih.gov/gquery?term=EU527211|https://www.ncbi.nlm.nih.gov/gquery?term=EU527308|https://www.ncbi.nlm.nih.gov/gquery?term=EU527261 |
| Value | Count | Frequency (%) |
| https://www.ncbi.nlm.nih.gov/gquery?term=jn837179|https://www.ncbi.nlm.nih.gov/gquery?term=jn837463|https://www.ncbi.nlm.nih.gov/gquery?term=jn837359|https://www.ncbi.nlm.nih.gov/gquery?term=jn837269 | 2 | 0.6% |
| https://www.ncbi.nlm.nih.gov/gquery?term=jn837192|https://www.ncbi.nlm.nih.gov/gquery?term=jn837282|https://www.ncbi.nlm.nih.gov/gquery?term=jn837372|https://www.ncbi.nlm.nih.gov/gquery?term=jn837475 | 2 | 0.6% |
| https://www.ncbi.nlm.nih.gov/gquery?term=eu527212|https://www.ncbi.nlm.nih.gov/gquery?term=eu527309|https://www.ncbi.nlm.nih.gov/gquery?term=eu527262 | 2 | 0.6% |
| https://www.ncbi.nlm.nih.gov/gquery?term=jn837150|https://www.ncbi.nlm.nih.gov/gquery?term=jn837436|https://www.ncbi.nlm.nih.gov/gquery?term=jn837330|https://www.ncbi.nlm.nih.gov/gquery?term=jn837240 | 2 | 0.6% |
| https://www.ncbi.nlm.nih.gov/gquery?term=jn837187|https://www.ncbi.nlm.nih.gov/gquery?term=jn837470|https://www.ncbi.nlm.nih.gov/gquery?term=jn837367|https://www.ncbi.nlm.nih.gov/gquery?term=jn837277 | 2 | 0.6% |
| https://www.ncbi.nlm.nih.gov/gquery?term=kf989555|https://www.ncbi.nlm.nih.gov/gquery?term=kf989872|https://www.ncbi.nlm.nih.gov/gquery?term=kf989774|https://www.ncbi.nlm.nih.gov/gquery?term=kf989974|https://www.ncbi.nlm.nih.gov/gquery?term=kf989663 | 2 | 0.6% |
| https://www.ncbi.nlm.nih.gov/gquery?term=jn837109|https://www.ncbi.nlm.nih.gov/gquery?term=jn837391|https://www.ncbi.nlm.nih.gov/gquery?term=jn837290|https://www.ncbi.nlm.nih.gov/gquery?term=jn837199 | 2 | 0.6% |
| https://www.ncbi.nlm.nih.gov/gquery?term=eu527241|https://www.ncbi.nlm.nih.gov/gquery?term=eu527291 | 2 | 0.6% |
| https://www.ncbi.nlm.nih.gov/gquery?term=jn837183|https://www.ncbi.nlm.nih.gov/gquery?term=jn837467|https://www.ncbi.nlm.nih.gov/gquery?term=jn837363|https://www.ncbi.nlm.nih.gov/gquery?term=jn837273 | 2 | 0.6% |
| https://www.ncbi.nlm.nih.gov/gquery?term=jn837191|https://www.ncbi.nlm.nih.gov/gquery?term=jn837281|https://www.ncbi.nlm.nih.gov/gquery?term=jn837371|https://www.ncbi.nlm.nih.gov/gquery?term=jn837474 | 2 | 0.6% |
| Other values (324) | 331 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 3984 | 8.1% |
| t | 2988 | 6.0% |
| / | 2988 | 6.0% |
| w | 2988 | 6.0% |
| n | 2988 | 6.0% |
| h | 1992 | 4.0% |
| i | 1992 | 4.0% |
| r | 1992 | 4.0% |
| m | 1992 | 4.0% |
| g | 1992 | 4.0% |
| Other values (40) | 23553 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 30876 | |
| Other Punctuation | 8964 | 18.1% |
| Decimal Number | 5976 | 12.1% |
| Uppercase Letter | 1992 | 4.0% |
| Math Symbol | 1641 | 3.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 2988 | 9.7% |
| w | 2988 | 9.7% |
| n | 2988 | 9.7% |
| h | 1992 | 6.5% |
| i | 1992 | 6.5% |
| r | 1992 | 6.5% |
| m | 1992 | 6.5% |
| g | 1992 | 6.5% |
| e | 1992 | 6.5% |
| u | 996 | 3.2% |
| Other values (9) | 8964 |
Uppercase Letter
| Value | Count | Frequency (%) |
| K | 384 | |
| F | 380 | |
| N | 338 | |
| J | 302 | |
| M | 96 | 4.8% |
| E | 94 | 4.7% |
| U | 94 | 4.7% |
| T | 91 | 4.6% |
| Y | 57 | 2.9% |
| A | 54 | 2.7% |
| Other values (5) | 102 | 5.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 1075 | |
| 8 | 963 | |
| 3 | 756 | |
| 7 | 724 | |
| 5 | 679 | |
| 2 | 540 | |
| 0 | 349 | 5.8% |
| 4 | 308 | 5.2% |
| 6 | 296 | 5.0% |
| 1 | 286 | 4.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3984 | |
| / | 2988 | |
| ? | 996 | 11.1% |
| : | 996 | 11.1% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 996 | |
| | | 645 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 32868 | |
| Common | 16581 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 2988 | 9.1% |
| w | 2988 | 9.1% |
| n | 2988 | 9.1% |
| h | 1992 | 6.1% |
| i | 1992 | 6.1% |
| r | 1992 | 6.1% |
| m | 1992 | 6.1% |
| g | 1992 | 6.1% |
| e | 1992 | 6.1% |
| u | 996 | 3.0% |
| Other values (24) | 10956 |
Common
| Value | Count | Frequency (%) |
| . | 3984 | |
| / | 2988 | |
| 9 | 1075 | 6.5% |
| = | 996 | 6.0% |
| ? | 996 | 6.0% |
| : | 996 | 6.0% |
| 8 | 963 | 5.8% |
| 3 | 756 | 4.6% |
| 7 | 724 | 4.4% |
| 5 | 679 | 4.1% |
| Other values (6) | 2424 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 49449 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 3984 | 8.1% |
| t | 2988 | 6.0% |
| / | 2988 | 6.0% |
| w | 2988 | 6.0% |
| n | 2988 | 6.0% |
| h | 1992 | 4.0% |
| i | 1992 | 4.0% |
| r | 1992 | 4.0% |
| m | 1992 | 4.0% |
| g | 1992 | 4.0% |
| Other values (40) | 23553 |
Missing 
| Distinct | 28685 |
|---|---|
| Distinct (%) | 31.3% |
| Missing | 4424129 |
| Missing (%) | 98.0% |
| Memory size | 34.5 MiB |
Length
| Max length | 54951 |
|---|---|
| Median length | 2489 |
| Mean length | 77.45450771 |
| Min length | 1 |
Unique
| Unique | 24536 ? |
|---|---|
| Unique (%) | 26.8% |
Sample
| 1st row | Received as: seed |
|---|---|
| 2nd row | Transcribed by digital volunteers |
| 3rd row | BRG |
| 4th row | Transcribed by digital volunteers; Original spelling as annotated and published is "subplebeia". Same (?) taxon re-published in Contr. U.S. Natl. Herb. 17: 46 (1913) with more explicit type citation. Unclear whether Lecidea subplebeia is a later homonym of Lecidea subplebeja Vain. (1890); Lecidea austrocalifornica Zahlbr. published as replacement name but citing Lecidea "subplebeja Nyl. apud Hasse". The latter name is superfluous if the original name is not a later homonym. |
| 5th row | US, NY |
| Value | Count | Frequency (%) |
| by | 38261 | 3.8% |
| transcribed | 30348 | 3.0% |
| digital | 30037 | 2.9% |
| volunteers | 30020 | 2.9% |
| 19663 | 1.9% | |
| of | 17327 | 1.7% |
| us | 14702 | 1.4% |
| as | 13939 | 1.4% |
| and | 12959 | 1.3% |
| the | 12017 | 1.2% |
| Other values (45812) | 799986 |
Most occurring characters
| Value | Count | Frequency (%) |
| 923590 | 13.0% | |
| e | 568137 | 8.0% |
| a | 443985 | 6.3% |
| i | 409730 | 5.8% |
| t | 348936 | 4.9% |
| n | 338603 | 4.8% |
| o | 337481 | 4.8% |
| r | 330807 | 4.7% |
| l | 294956 | 4.2% |
| s | 270029 | 3.8% |
| Other values (133) | 2823312 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4729210 | |
| Space Separator | 923590 | 13.0% |
| Uppercase Letter | 609062 | 8.6% |
| Other Punctuation | 376914 | 5.3% |
| Decimal Number | 324218 | 4.6% |
| Dash Punctuation | 37930 | 0.5% |
| Open Punctuation | 31537 | 0.4% |
| Close Punctuation | 31514 | 0.4% |
| Control | 22926 | 0.3% |
| Connector Punctuation | 867 | < 0.1% |
| Other values (9) | 1798 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 568137 | |
| a | 443985 | 9.4% |
| i | 409730 | 8.7% |
| t | 348936 | 7.4% |
| n | 338603 | 7.2% |
| o | 337481 | 7.1% |
| r | 330807 | 7.0% |
| l | 294956 | 6.2% |
| s | 270029 | 5.7% |
| c | 199665 | 4.2% |
| Other values (38) | 1186881 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 64247 | 10.5% |
| T | 55830 | 9.2% |
| C | 52060 | 8.5% |
| A | 46853 | 7.7% |
| B | 38326 | 6.3% |
| P | 32445 | 5.3% |
| F | 29814 | 4.9% |
| R | 27321 | 4.5% |
| H | 26318 | 4.3% |
| M | 25872 | 4.2% |
| Other values (21) | 209976 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 162150 | |
| , | 110035 | |
| ; | 39470 | 10.5% |
| : | 28446 | 7.5% |
| " | 18510 | 4.9% |
| & | 7482 | 2.0% |
| ' | 4679 | 1.2% |
| / | 3130 | 0.8% |
| ? | 1562 | 0.4% |
| # | 885 | 0.2% |
| Other values (8) | 565 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 71150 | |
| 9 | 44064 | |
| 2 | 35632 | |
| 0 | 33874 | |
| 3 | 26448 | 8.2% |
| 8 | 25545 | 7.9% |
| 4 | 23422 | 7.2% |
| 5 | 22222 | 6.9% |
| 7 | 22144 | 6.8% |
| 6 | 19717 | 6.1% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 639 | |
| + | 137 | 15.9% |
| × | 58 | 6.7% |
| ~ | 12 | 1.4% |
| > | 5 | 0.6% |
| ± | 4 | 0.5% |
| < | 3 | 0.3% |
| ¬ | 1 | 0.1% |
| | | 1 | 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 37484 | |
| – | 440 | 1.2% |
| — | 5 | < 0.1% |
| ‒ | 1 | < 0.1% |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 98 | |
| © | 17 | 14.2% |
| ♂ | 4 | 3.3% |
| ® | 1 | 0.8% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 29639 | |
| [ | 1894 | 6.0% |
| { | 4 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 29625 | |
| ] | 1887 | 6.0% |
| } | 2 | < 0.1% |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ́ | 309 | |
| ̧ | 103 | 20.0% |
| ̀ | 103 | 20.0% |
Control
| Value | Count | Frequency (%) |
| 22806 | ||
| 120 | 0.5% |
Space Separator
| Value | Count | Frequency (%) |
| 923590 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 867 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 148 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 142 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 7 |
Other Letter
| Value | Count | Frequency (%) |
| º | 4 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 1 |
Other Number
| Value | Count | Frequency (%) |
| ½ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5338276 | |
| Common | 1750775 | 24.7% |
| Inherited | 515 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 568137 | 10.6% |
| a | 443985 | 8.3% |
| i | 409730 | 7.7% |
| t | 348936 | 6.5% |
| n | 338603 | 6.3% |
| o | 337481 | 6.3% |
| r | 330807 | 6.2% |
| l | 294956 | 5.5% |
| s | 270029 | 5.1% |
| c | 199665 | 3.7% |
| Other values (70) | 1795947 |
Common
| Value | Count | Frequency (%) |
| 923590 | ||
| . | 162150 | 9.3% |
| , | 110035 | 6.3% |
| 1 | 71150 | 4.1% |
| 9 | 44064 | 2.5% |
| ; | 39470 | 2.3% |
| - | 37484 | 2.1% |
| 2 | 35632 | 2.0% |
| 0 | 33874 | 1.9% |
| ( | 29639 | 1.7% |
| Other values (50) | 263687 | 15.1% |
Inherited
| Value | Count | Frequency (%) |
| ́ | 309 | |
| ̧ | 103 | 20.0% |
| ̀ | 103 | 20.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7085628 | |
| None | 2622 | < 0.1% |
| Punctuation | 797 | < 0.1% |
| Diacriticals | 515 | < 0.1% |
| Misc Symbols | 4 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 923590 | 13.0% | |
| e | 568137 | 8.0% |
| a | 443985 | 6.3% |
| i | 409730 | 5.8% |
| t | 348936 | 4.9% |
| n | 338603 | 4.8% |
| o | 337481 | 4.8% |
| r | 330807 | 4.7% |
| l | 294956 | 4.2% |
| s | 270029 | 3.8% |
| Other values (85) | 2819374 |
None
| Value | Count | Frequency (%) |
| í | 836 | |
| é | 406 | |
| ñ | 338 | |
| á | 275 | 10.5% |
| ó | 130 | 5.0% |
| ° | 98 | 3.7% |
| ç | 87 | 3.3% |
| è | 79 | 3.0% |
| ü | 69 | 2.6% |
| ö | 61 | 2.3% |
| Other values (27) | 243 | 9.3% |
Punctuation
| Value | Count | Frequency (%) |
| – | 440 | |
| ” | 148 | 18.6% |
| “ | 142 | 17.8% |
| • | 48 | 6.0% |
| … | 13 | 1.6% |
| — | 5 | 0.6% |
| ‒ | 1 | 0.1% |
Diacriticals
| Value | Count | Frequency (%) |
| ́ | 309 | |
| ̧ | 103 | 20.0% |
| ̀ | 103 | 20.0% |
Misc Symbols
| Value | Count | Frequency (%) |
| ♂ | 4 |
organismName
Text
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 4515658 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 5.666666667 |
| Min length | 5 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 3018.0 |
|---|---|
| 2nd row | 300.0 |
| 3rd row | 1580.0 |
| Value | Count | Frequency (%) |
| 3018.0 | 1 | |
| 300.0 | 1 | |
| 1580.0 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 7 | |
| . | 3 | |
| 3 | 2 | 11.8% |
| 1 | 2 | 11.8% |
| 8 | 2 | 11.8% |
| 5 | 1 | 5.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 14 | |
| Other Punctuation | 3 | 17.6% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 7 | |
| 3 | 2 | 14.3% |
| 1 | 2 | 14.3% |
| 8 | 2 | 14.3% |
| 5 | 1 | 7.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 17 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 7 | |
| . | 3 | |
| 3 | 2 | 11.8% |
| 1 | 2 | 11.8% |
| 8 | 2 | 11.8% |
| 5 | 1 | 5.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 7 | |
| . | 3 | |
| 3 | 2 | 11.8% |
| 1 | 2 | 11.8% |
| 8 | 2 | 11.8% |
| 5 | 1 | 5.9% |
eventType
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 4515660 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | -7.38 |
|---|
| Value | Count | Frequency (%) |
| 7.38 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 1 | |
| 7 | 1 | |
| . | 1 | |
| 3 | 1 | |
| 8 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3 | |
| Dash Punctuation | 1 | 20.0% |
| Other Punctuation | 1 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 1 | |
| 3 | 1 | |
| 8 | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 1 | |
| 7 | 1 | |
| . | 1 | |
| 3 | 1 | |
| 8 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| - | 1 | |
| 7 | 1 | |
| . | 1 | |
| 3 | 1 | |
| 8 | 1 |
fieldNumber
Text
Missing 
| Distinct | 15 |
|---|---|
| Distinct (%) | 5.7% |
| Missing | 4515399 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 9 |
| Mean length | 9.122137405 |
| Min length | 4 |
Unique
| Unique | 8 ? |
|---|---|
| Unique (%) | 3.1% |
Sample
| 1st row | Sample OY |
|---|---|
| 2nd row | Sample OY |
| 3rd row | Sample OY |
| 4th row | Sample OY |
| 5th row | Sample OY |
| Value | Count | Frequency (%) |
| sample | 240 | |
| oy | 240 | |
| koolau | 5 | 1.0% |
| b | 4 | 0.8% |
| a | 4 | 0.8% |
| 259 | 3 | 0.6% |
| l-52 | 3 | 0.6% |
| koolau_784 | 3 | 0.6% |
| 17-v-88-5-n | 2 | 0.4% |
| zeeland | 2 | 0.4% |
| Other values (13) | 15 | 2.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 263 | |
| 259 | ||
| l | 255 | |
| e | 247 | |
| S | 241 | |
| p | 240 | |
| m | 240 | |
| O | 240 | |
| Y | 240 | |
| o | 19 | 0.8% |
| Other values (34) | 146 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1302 | |
| Uppercase Letter | 750 | |
| Space Separator | 259 | 10.8% |
| Decimal Number | 57 | 2.4% |
| Dash Punctuation | 12 | 0.5% |
| Connector Punctuation | 7 | 0.3% |
| Other Punctuation | 3 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 263 | |
| l | 255 | |
| e | 247 | |
| p | 240 | |
| m | 240 | |
| o | 19 | 1.5% |
| u | 9 | 0.7% |
| n | 7 | 0.5% |
| i | 5 | 0.4% |
| b | 4 | 0.3% |
| Other values (7) | 13 | 1.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 241 | |
| O | 240 | |
| Y | 240 | |
| K | 9 | 1.2% |
| B | 4 | 0.5% |
| A | 3 | 0.4% |
| L | 3 | 0.4% |
| V | 3 | 0.4% |
| Z | 2 | 0.3% |
| H | 2 | 0.3% |
| Other values (3) | 3 | 0.4% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 11 | |
| 5 | 11 | |
| 8 | 10 | |
| 1 | 6 | |
| 7 | 5 | |
| 4 | 5 | |
| 0 | 4 | 7.0% |
| 9 | 3 | 5.3% |
| 3 | 1 | 1.8% |
| 6 | 1 | 1.8% |
Space Separator
| Value | Count | Frequency (%) |
| 259 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 12 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 7 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2052 | |
| Common | 338 | 14.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 263 | |
| l | 255 | |
| e | 247 | |
| S | 241 | |
| p | 240 | |
| m | 240 | |
| O | 240 | |
| Y | 240 | |
| o | 19 | 0.9% |
| u | 9 | 0.4% |
| Other values (20) | 58 | 2.8% |
Common
| Value | Count | Frequency (%) |
| 259 | ||
| - | 12 | 3.6% |
| 2 | 11 | 3.3% |
| 5 | 11 | 3.3% |
| 8 | 10 | 3.0% |
| _ | 7 | 2.1% |
| 1 | 6 | 1.8% |
| 7 | 5 | 1.5% |
| 4 | 5 | 1.5% |
| 0 | 4 | 1.2% |
| Other values (4) | 8 | 2.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2390 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 263 | |
| 259 | ||
| l | 255 | |
| e | 247 | |
| S | 241 | |
| p | 240 | |
| m | 240 | |
| O | 240 | |
| Y | 240 | |
| o | 19 | 0.8% |
| Other values (34) | 146 |
eventDate
Text
Missing 
| Distinct | 100743 |
|---|---|
| Distinct (%) | 2.5% |
| Missing | 499507 |
| Missing (%) | 11.1% |
| Memory size | 34.5 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 10 |
| Mean length | 10.22911024 |
| Min length | 4 |
Unique
| Unique | 26184 ? |
|---|---|
| Unique (%) | 0.7% |
Sample
| 1st row | 1981-04-30 |
|---|---|
| 2nd row | 1954-08-07 |
| 3rd row | 1947-04-03 |
| 4th row | 1966-04-01 |
| 5th row | 1971-03-23 |
| Value | Count | Frequency (%) |
| or | 7511 | 0.2% |
| 1838/1842 | 6999 | 0.2% |
| 1891 | 4648 | 0.1% |
| 1760/1808 | 3657 | 0.1% |
| 1889 | 3487 | 0.1% |
| 1875 | 3339 | 0.1% |
| 1853/1856 | 3324 | 0.1% |
| 1890 | 3156 | 0.1% |
| 1923 | 3139 | 0.1% |
| 1887 | 3048 | 0.1% |
| Other values (96839) | 3988868 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 7801445 | |
| - | 7726696 | |
| 0 | 6238271 | |
| 9 | 5176217 | |
| 2 | 3077613 | 7.5% |
| 8 | 2443585 | 5.9% |
| 7 | 1744336 | 4.2% |
| 6 | 1741318 | 4.2% |
| 3 | 1670243 | 4.1% |
| 5 | 1552877 | 3.8% |
| Other values (6) | 1909081 | 4.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 32969564 | |
| Dash Punctuation | 7726696 | 18.8% |
| Other Punctuation | 355378 | 0.9% |
| Space Separator | 15022 | < 0.1% |
| Lowercase Letter | 15022 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 7801445 | |
| 0 | 6238271 | |
| 9 | 5176217 | |
| 2 | 3077613 | 9.3% |
| 8 | 2443585 | 7.4% |
| 7 | 1744336 | 5.3% |
| 6 | 1741318 | 5.3% |
| 3 | 1670243 | 5.1% |
| 5 | 1552877 | 4.7% |
| 4 | 1523659 | 4.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 347825 | |
| , | 7553 | 2.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 7511 | |
| r | 7511 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 7726696 |
Space Separator
| Value | Count | Frequency (%) |
| 15022 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 41066660 | |
| Latin | 15022 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 7801445 | |
| - | 7726696 | |
| 0 | 6238271 | |
| 9 | 5176217 | |
| 2 | 3077613 | 7.5% |
| 8 | 2443585 | 6.0% |
| 7 | 1744336 | 4.2% |
| 6 | 1741318 | 4.2% |
| 3 | 1670243 | 4.1% |
| 5 | 1552877 | 3.8% |
| Other values (4) | 1894059 | 4.6% |
Latin
| Value | Count | Frequency (%) |
| o | 7511 | |
| r | 7511 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 41081682 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 7801445 | |
| - | 7726696 | |
| 0 | 6238271 | |
| 9 | 5176217 | |
| 2 | 3077613 | 7.5% |
| 8 | 2443585 | 5.9% |
| 7 | 1744336 | 4.2% |
| 6 | 1741318 | 4.2% |
| 3 | 1670243 | 4.1% |
| 5 | 1552877 | 3.8% |
| Other values (6) | 1909081 | 4.6% |
startDayOfYear
Text
Missing 
| Distinct | 366 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 707937 |
| Missing (%) | 15.7% |
| Memory size | 34.5 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.786185921 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 120 |
|---|---|
| 2nd row | 219 |
| 3rd row | 93 |
| 4th row | 91 |
| 5th row | 82 |
| Value | Count | Frequency (%) |
| 212 | 65252 | 1.7% |
| 243 | 53926 | 1.4% |
| 181 | 53248 | 1.4% |
| 151 | 48909 | 1.3% |
| 120 | 37888 | 1.0% |
| 213 | 35186 | 0.9% |
| 273 | 34755 | 0.9% |
| 90 | 31604 | 0.8% |
| 304 | 30708 | 0.8% |
| 244 | 28680 | 0.8% |
| Other values (356) | 3387568 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2141855 | |
| 2 | 2135625 | |
| 3 | 1271069 | |
| 4 | 814683 | 7.7% |
| 5 | 784204 | 7.4% |
| 0 | 747144 | 7.0% |
| 9 | 692015 | 6.5% |
| 6 | 686000 | 6.5% |
| 8 | 680008 | 6.4% |
| 7 | 656424 | 6.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 10609027 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2141855 | |
| 2 | 2135625 | |
| 3 | 1271069 | |
| 4 | 814683 | 7.7% |
| 5 | 784204 | 7.4% |
| 0 | 747144 | 7.0% |
| 9 | 692015 | 6.5% |
| 6 | 686000 | 6.5% |
| 8 | 680008 | 6.4% |
| 7 | 656424 | 6.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 10609027 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 2141855 | |
| 2 | 2135625 | |
| 3 | 1271069 | |
| 4 | 814683 | 7.7% |
| 5 | 784204 | 7.4% |
| 0 | 747144 | 7.0% |
| 9 | 692015 | 6.5% |
| 6 | 686000 | 6.5% |
| 8 | 680008 | 6.4% |
| 7 | 656424 | 6.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10609027 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2141855 | |
| 2 | 2135625 | |
| 3 | 1271069 | |
| 4 | 814683 | 7.7% |
| 5 | 784204 | 7.4% |
| 0 | 747144 | 7.0% |
| 9 | 692015 | 6.5% |
| 6 | 686000 | 6.5% |
| 8 | 680008 | 6.4% |
| 7 | 656424 | 6.2% |
endDayOfYear
Text
Missing 
| Distinct | 366 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 706297 |
| Missing (%) | 15.6% |
| Memory size | 34.5 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.787330116 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 120 |
|---|---|
| 2nd row | 219 |
| 3rd row | 93 |
| 4th row | 91 |
| 5th row | 82 |
| Value | Count | Frequency (%) |
| 212 | 66123 | 1.7% |
| 243 | 57269 | 1.5% |
| 181 | 53207 | 1.4% |
| 151 | 44291 | 1.2% |
| 120 | 37736 | 1.0% |
| 273 | 36145 | 0.9% |
| 90 | 33356 | 0.9% |
| 304 | 33021 | 0.9% |
| 213 | 32570 | 0.9% |
| 244 | 30634 | 0.8% |
| Other values (356) | 3385012 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 2140880 | |
| 1 | 2116002 | |
| 3 | 1288304 | |
| 4 | 826923 | 7.8% |
| 5 | 780021 | 7.3% |
| 0 | 749805 | 7.1% |
| 9 | 689036 | 6.5% |
| 6 | 683421 | 6.4% |
| 8 | 681866 | 6.4% |
| 7 | 661697 | 6.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 10617955 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2140880 | |
| 1 | 2116002 | |
| 3 | 1288304 | |
| 4 | 826923 | 7.8% |
| 5 | 780021 | 7.3% |
| 0 | 749805 | 7.1% |
| 9 | 689036 | 6.5% |
| 6 | 683421 | 6.4% |
| 8 | 681866 | 6.4% |
| 7 | 661697 | 6.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 10617955 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 2140880 | |
| 1 | 2116002 | |
| 3 | 1288304 | |
| 4 | 826923 | 7.8% |
| 5 | 780021 | 7.3% |
| 0 | 749805 | 7.1% |
| 9 | 689036 | 6.5% |
| 6 | 683421 | 6.4% |
| 8 | 681866 | 6.4% |
| 7 | 661697 | 6.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10617955 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 2140880 | |
| 1 | 2116002 | |
| 3 | 1288304 | |
| 4 | 826923 | 7.8% |
| 5 | 780021 | 7.3% |
| 0 | 749805 | 7.1% |
| 9 | 689036 | 6.5% |
| 6 | 683421 | 6.4% |
| 8 | 681866 | 6.4% |
| 7 | 661697 | 6.2% |
year
Text
Missing 
| Distinct | 275 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 499507 |
| Missing (%) | 11.1% |
| Memory size | 34.5 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 14 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1981 |
|---|---|
| 2nd row | 1954 |
| 3rd row | 1947 |
| 4th row | 1966 |
| 5th row | 1971 |
| Value | Count | Frequency (%) |
| 1966 | 52747 | 1.3% |
| 1964 | 51506 | 1.3% |
| 1939 | 48539 | 1.2% |
| 1949 | 46743 | 1.2% |
| 1929 | 45873 | 1.1% |
| 1938 | 44996 | 1.1% |
| 1965 | 44897 | 1.1% |
| 1922 | 42897 | 1.1% |
| 1962 | 42322 | 1.1% |
| 1968 | 41351 | 1.0% |
| Other values (265) | 3554283 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 4564194 | |
| 9 | 4113450 | |
| 8 | 1419370 | 8.8% |
| 0 | 1047460 | 6.5% |
| 2 | 964799 | 6.0% |
| 6 | 886880 | 5.5% |
| 4 | 787306 | 4.9% |
| 3 | 776965 | 4.8% |
| 7 | 755520 | 4.7% |
| 5 | 748672 | 4.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 16064616 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 4564194 | |
| 9 | 4113450 | |
| 8 | 1419370 | 8.8% |
| 0 | 1047460 | 6.5% |
| 2 | 964799 | 6.0% |
| 6 | 886880 | 5.5% |
| 4 | 787306 | 4.9% |
| 3 | 776965 | 4.8% |
| 7 | 755520 | 4.7% |
| 5 | 748672 | 4.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 16064616 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 4564194 | |
| 9 | 4113450 | |
| 8 | 1419370 | 8.8% |
| 0 | 1047460 | 6.5% |
| 2 | 964799 | 6.0% |
| 6 | 886880 | 5.5% |
| 4 | 787306 | 4.9% |
| 3 | 776965 | 4.8% |
| 7 | 755520 | 4.7% |
| 5 | 748672 | 4.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16064616 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 4564194 | |
| 9 | 4113450 | |
| 8 | 1419370 | 8.8% |
| 0 | 1047460 | 6.5% |
| 2 | 964799 | 6.0% |
| 6 | 886880 | 5.5% |
| 4 | 787306 | 4.9% |
| 3 | 776965 | 4.8% |
| 7 | 755520 | 4.7% |
| 5 | 748672 | 4.7% |
month
Text
Missing 
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 700051 |
| Missing (%) | 15.5% |
| Memory size | 34.5 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.171101868 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 4 |
|---|---|
| 2nd row | 8 |
| 3rd row | 4 |
| 4th row | 4 |
| 5th row | 3 |
| Value | Count | Frequency (%) |
| 7 | 547270 | |
| 8 | 491637 | |
| 6 | 411350 | |
| 5 | 354199 | |
| 9 | 340693 | |
| 4 | 293710 | |
| 3 | 268012 | |
| 10 | 260725 | |
| 2 | 235530 | |
| 1 | 220351 | |
| Other values (2) | 392133 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1080650 | |
| 7 | 547270 | |
| 8 | 491637 | |
| 2 | 420222 | 9.4% |
| 6 | 411350 | 9.2% |
| 5 | 354199 | 7.9% |
| 9 | 340693 | 7.6% |
| 4 | 293710 | 6.6% |
| 3 | 268012 | 6.0% |
| 0 | 260725 | 5.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4468468 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1080650 | |
| 7 | 547270 | |
| 8 | 491637 | |
| 2 | 420222 | 9.4% |
| 6 | 411350 | 9.2% |
| 5 | 354199 | 7.9% |
| 9 | 340693 | 7.6% |
| 4 | 293710 | 6.6% |
| 3 | 268012 | 6.0% |
| 0 | 260725 | 5.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4468468 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1080650 | |
| 7 | 547270 | |
| 8 | 491637 | |
| 2 | 420222 | 9.4% |
| 6 | 411350 | 9.2% |
| 5 | 354199 | 7.9% |
| 9 | 340693 | 7.6% |
| 4 | 293710 | 6.6% |
| 3 | 268012 | 6.0% |
| 0 | 260725 | 5.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4468468 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1080650 | |
| 7 | 547270 | |
| 8 | 491637 | |
| 2 | 420222 | 9.4% |
| 6 | 411350 | 9.2% |
| 5 | 354199 | 7.9% |
| 9 | 340693 | 7.6% |
| 4 | 293710 | 6.6% |
| 3 | 268012 | 6.0% |
| 0 | 260725 | 5.8% |
day
Text
Missing 
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1180026 |
| Missing (%) | 26.1% |
| Memory size | 34.5 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 1.709506586 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 30 |
|---|---|
| 2nd row | 7 |
| 3rd row | 3 |
| 4th row | 1 |
| 5th row | 23 |
| Value | Count | Frequency (%) |
| 20 | 124820 | 3.7% |
| 15 | 121526 | 3.6% |
| 10 | 116402 | 3.5% |
| 1 | 116290 | 3.5% |
| 18 | 114533 | 3.4% |
| 25 | 112462 | 3.4% |
| 19 | 112076 | 3.4% |
| 17 | 111731 | 3.3% |
| 12 | 111029 | 3.3% |
| 8 | 109940 | 3.3% |
| Other values (21) | 2184826 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1506435 | |
| 2 | 1419764 | |
| 3 | 471332 | 8.3% |
| 5 | 342700 | 6.0% |
| 0 | 340158 | 6.0% |
| 8 | 333415 | 5.8% |
| 7 | 328337 | 5.8% |
| 6 | 325244 | 5.7% |
| 4 | 321679 | 5.6% |
| 9 | 313226 | 5.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5702290 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1506435 | |
| 2 | 1419764 | |
| 3 | 471332 | 8.3% |
| 5 | 342700 | 6.0% |
| 0 | 340158 | 6.0% |
| 8 | 333415 | 5.8% |
| 7 | 328337 | 5.8% |
| 6 | 325244 | 5.7% |
| 4 | 321679 | 5.6% |
| 9 | 313226 | 5.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5702290 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1506435 | |
| 2 | 1419764 | |
| 3 | 471332 | 8.3% |
| 5 | 342700 | 6.0% |
| 0 | 340158 | 6.0% |
| 8 | 333415 | 5.8% |
| 7 | 328337 | 5.8% |
| 6 | 325244 | 5.7% |
| 4 | 321679 | 5.6% |
| 9 | 313226 | 5.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5702290 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1506435 | |
| 2 | 1419764 | |
| 3 | 471332 | 8.3% |
| 5 | 342700 | 6.0% |
| 0 | 340158 | 6.0% |
| 8 | 333415 | 5.8% |
| 7 | 328337 | 5.8% |
| 6 | 325244 | 5.7% |
| 4 | 321679 | 5.6% |
| 9 | 313226 | 5.5% |
Missing 
| Distinct | 144603 |
|---|---|
| Distinct (%) | 9.5% |
| Missing | 2995056 |
| Missing (%) | 66.3% |
| Memory size | 34.5 MiB |
Length
| Max length | 44726 |
|---|---|
| Median length | 11 |
| Mean length | 13.4161199 |
| Min length | 1 |
Unique
| Unique | 47164 ? |
|---|---|
| Unique (%) | 3.1% |
Sample
| 1st row | 30 Apr 1981 |
|---|---|
| 2nd row | 16 Dec 1953 |
| 3rd row | -- --- ---- |
| 4th row | 01 Feb 1974 |
| 5th row | Transcribed d/m/y: 28/4/76 |
| Value | Count | Frequency (%) |
| 569685 | 12.1% | |
| transcribed | 163800 | 3.5% |
| d/m/y | 163799 | 3.5% |
| jul | 133516 | 2.8% |
| aug | 127314 | 2.7% |
| may | 100704 | 2.1% |
| sep | 100407 | 2.1% |
| jun | 99914 | 2.1% |
| mar | 89911 | 1.9% |
| to | 89366 | 1.9% |
| Other values (50980) | 3069137 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3184972 | ||
| 1 | 2057978 | 10.1% |
| - | 1706716 | 8.4% |
| 9 | 1491498 | 7.3% |
| 2 | 903531 | 4.4% |
| 0 | 760159 | 3.7% |
| / | 668632 | 3.3% |
| 8 | 665330 | 3.3% |
| r | 587998 | 2.9% |
| e | 493324 | 2.4% |
| Other values (100) | 7880481 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8113118 | |
| Lowercase Letter | 5099499 | |
| Space Separator | 3184972 | 15.6% |
| Dash Punctuation | 1706716 | 8.4% |
| Uppercase Letter | 1427985 | 7.0% |
| Other Punctuation | 856153 | 4.2% |
| Control | 11040 | 0.1% |
| Open Punctuation | 519 | < 0.1% |
| Close Punctuation | 515 | < 0.1% |
| Math Symbol | 76 | < 0.1% |
| Other values (6) | 26 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 587998 | |
| e | 493324 | 9.7% |
| a | 480500 | 9.4% |
| u | 422327 | 8.3% |
| n | 369730 | 7.3% |
| d | 335298 | 6.6% |
| c | 329419 | 6.5% |
| y | 293646 | 5.8% |
| b | 281590 | 5.5% |
| p | 221434 | 4.3% |
| Other values (31) | 1284233 |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 346402 | |
| A | 244422 | |
| M | 212179 | |
| T | 165091 | |
| S | 118257 | 8.3% |
| F | 97247 | 6.8% |
| O | 92390 | 6.5% |
| N | 75920 | 5.3% |
| D | 66935 | 4.7% |
| I | 3031 | 0.2% |
| Other values (20) | 6111 | 0.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 668632 | |
| : | 164474 | 19.2% |
| , | 11572 | 1.4% |
| . | 9358 | 1.1% |
| ' | 713 | 0.1% |
| ? | 654 | 0.1% |
| ! | 466 | 0.1% |
| ; | 146 | < 0.1% |
| & | 67 | < 0.1% |
| * | 53 | < 0.1% |
| Other values (2) | 18 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2057978 | |
| 9 | 1491498 | |
| 2 | 903531 | |
| 0 | 760159 | 9.4% |
| 8 | 665330 | 8.2% |
| 6 | 485773 | 6.0% |
| 3 | 469301 | 5.8% |
| 4 | 428035 | 5.3% |
| 7 | 426717 | 5.3% |
| 5 | 424796 | 5.2% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 69 | |
| + | 4 | 5.3% |
| ± | 3 | 3.9% |
Control
| Value | Count | Frequency (%) |
| 10982 | ||
| 58 | 0.5% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 269 | |
| ( | 250 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 265 | |
| ) | 250 |
Space Separator
| Value | Count | Frequency (%) |
| 3184972 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1706716 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 18 |
Other Number
| Value | Count | Frequency (%) |
| ½ | 3 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 2 |
Other Letter
| Value | Count | Frequency (%) |
| º | 1 |
Modifier Letter
| Value | Count | Frequency (%) |
| ᵉ | 1 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 13873133 | |
| Latin | 6527486 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 587998 | 9.0% |
| e | 493324 | 7.6% |
| a | 480500 | 7.4% |
| u | 422327 | 6.5% |
| n | 369730 | 5.7% |
| J | 346402 | 5.3% |
| d | 335298 | 5.1% |
| c | 329419 | 5.0% |
| y | 293646 | 4.5% |
| b | 281590 | 4.3% |
| Other values (63) | 2587252 |
Common
| Value | Count | Frequency (%) |
| 3184972 | ||
| 1 | 2057978 | |
| - | 1706716 | |
| 9 | 1491498 | |
| 2 | 903531 | 6.5% |
| 0 | 760159 | 5.5% |
| / | 668632 | 4.8% |
| 8 | 665330 | 4.8% |
| 6 | 485773 | 3.5% |
| 3 | 469301 | 3.4% |
| Other values (27) | 1479243 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20400500 | |
| None | 117 | < 0.1% |
| Phonetic Ext | 1 | < 0.1% |
| Punctuation | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3184972 | ||
| 1 | 2057978 | 10.1% |
| - | 1706716 | 8.4% |
| 9 | 1491498 | 7.3% |
| 2 | 903531 | 4.4% |
| 0 | 760159 | 3.7% |
| / | 668632 | 3.3% |
| 8 | 665330 | 3.3% |
| r | 587998 | 2.9% |
| e | 493324 | 2.4% |
| Other values (75) | 7880362 |
None
| Value | Count | Frequency (%) |
| é | 41 | |
| û | 16 | 13.7% |
| Æ | 8 | 6.8% |
| ó | 8 | 6.8% |
| ü | 7 | 6.0% |
| á | 5 | 4.3% |
| í | 5 | 4.3% |
| ô | 4 | 3.4% |
| ½ | 3 | 2.6% |
| ± | 3 | 2.6% |
| Other values (13) | 17 |
Phonetic Ext
| Value | Count | Frequency (%) |
| ᵉ | 1 |
Punctuation
| Value | Count | Frequency (%) |
| ” | 1 |
habitat
Text
Missing 
| Distinct | 179316 |
|---|---|
| Distinct (%) | 35.4% |
| Missing | 4009333 |
| Missing (%) | 88.8% |
| Memory size | 34.5 MiB |
Length
| Max length | 98062 |
|---|---|
| Median length | 506 |
| Mean length | 33.74884265 |
| Min length | 1 |
Unique
| Unique | 136896 ? |
|---|---|
| Unique (%) | 27.0% |
Sample
| 1st row | Erect. |
|---|---|
| 2nd row | Planted |
| 3rd row | Hillsides covered with broad-leaved forest, understory with Arthrostylidium, Rubus, and numerous ferns, epiphytes and Usnea. |
| 4th row | Open to closed forest with Pinus contorta, Populus tremuloides, Purshia tridentata, and Ribes cereum. |
| 5th row | Deep secondary forest; clay soil |
| Value | Count | Frequency (%) |
| forest | 131247 | 5.0% |
| on | 90052 | 3.4% |
| and | 73888 | 2.8% |
| in | 67428 | 2.6% |
| with | 54107 | 2.1% |
| of | 49767 | 1.9% |
| along | 29262 | 1.1% |
| de | 27744 | 1.1% |
| soil | 24644 | 0.9% |
| slopes | 22332 | 0.9% |
| Other values (44622) | 2044337 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2101244 | 12.3% | |
| e | 1541379 | 9.0% |
| a | 1348535 | 7.9% |
| o | 1227275 | 7.2% |
| r | 1069847 | 6.3% |
| s | 1066411 | 6.2% |
| n | 1051888 | 6.2% |
| i | 901575 | 5.3% |
| t | 853198 | 5.0% |
| l | 666293 | 3.9% |
| Other values (135) | 5260339 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 13594397 | |
| Space Separator | 2101244 | 12.3% |
| Uppercase Letter | 746465 | 4.4% |
| Other Punctuation | 472867 | 2.8% |
| Decimal Number | 67702 | 0.4% |
| Dash Punctuation | 40071 | 0.2% |
| Control | 39873 | 0.2% |
| Close Punctuation | 10898 | 0.1% |
| Open Punctuation | 10825 | 0.1% |
| Math Symbol | 3419 | < 0.1% |
| Other values (8) | 223 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1541379 | |
| a | 1348535 | |
| o | 1227275 | 9.0% |
| r | 1069847 | 7.9% |
| s | 1066411 | 7.8% |
| n | 1051888 | 7.7% |
| i | 901575 | 6.6% |
| t | 853198 | 6.3% |
| l | 666293 | 4.9% |
| d | 655799 | 4.8% |
| Other values (47) | 3212197 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 91327 | |
| M | 68847 | 9.2% |
| C | 52021 | 7.0% |
| P | 49180 | 6.6% |
| O | 46573 | 6.2% |
| A | 46219 | 6.2% |
| R | 44856 | 6.0% |
| D | 41895 | 5.6% |
| B | 40944 | 5.5% |
| F | 39683 | 5.3% |
| Other values (21) | 224920 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 205238 | |
| . | 204309 | |
| ; | 28753 | 6.1% |
| & | 10542 | 2.2% |
| / | 7989 | 1.7% |
| : | 7982 | 1.7% |
| " | 3600 | 0.8% |
| ' | 2231 | 0.5% |
| ? | 964 | 0.2% |
| % | 677 | 0.1% |
| Other values (6) | 582 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 16184 | |
| 1 | 9224 | |
| 3 | 8235 | |
| 2 | 7849 | |
| 5 | 7153 | |
| 4 | 5839 | 8.6% |
| 6 | 4326 | 6.4% |
| 8 | 3600 | 5.3% |
| 9 | 2857 | 4.2% |
| 7 | 2435 | 3.6% |
Math Symbol
| Value | Count | Frequency (%) |
| ~ | 1916 | |
| | | 579 | 16.9% |
| + | 396 | 11.6% |
| = | 268 | 7.8% |
| ± | 193 | 5.6% |
| > | 35 | 1.0% |
| < | 30 | 0.9% |
| ≤ | 1 | < 0.1% |
| × | 1 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 40030 | |
| – | 27 | 0.1% |
| — | 14 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 10326 | |
| ] | 356 | 3.3% |
| } | 216 | 2.0% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 10271 | |
| [ | 337 | 3.1% |
| { | 217 | 2.0% |
Control
| Value | Count | Frequency (%) |
| 39663 | ||
| 210 | 0.5% |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 116 | |
| ¦ | 1 | 0.9% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ^ | 2 | |
| ´ | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 2101244 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 31 |
Other Letter
| Value | Count | Frequency (%) |
| º | 25 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 22 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 20 |
Other Number
| Value | Count | Frequency (%) |
| ² | 3 |
Currency Symbol
| Value | Count | Frequency (%) |
| £ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 14340887 | |
| Common | 2747097 | 16.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1541379 | 10.7% |
| a | 1348535 | 9.4% |
| o | 1227275 | 8.6% |
| r | 1069847 | 7.5% |
| s | 1066411 | 7.4% |
| n | 1051888 | 7.3% |
| i | 901575 | 6.3% |
| t | 853198 | 5.9% |
| l | 666293 | 4.6% |
| d | 655799 | 4.6% |
| Other values (79) | 3958687 |
Common
| Value | Count | Frequency (%) |
| 2101244 | ||
| , | 205238 | 7.5% |
| . | 204309 | 7.4% |
| - | 40030 | 1.5% |
| 39663 | 1.4% | |
| ; | 28753 | 1.0% |
| 0 | 16184 | 0.6% |
| & | 10542 | 0.4% |
| ) | 10326 | 0.4% |
| ( | 10271 | 0.4% |
| Other values (46) | 80537 | 2.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17056686 | |
| None | 31039 | 0.2% |
| Punctuation | 258 | < 0.1% |
| Math Operators | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2101244 | ||
| e | 1541379 | 9.0% |
| a | 1348535 | 7.9% |
| o | 1227275 | 7.2% |
| r | 1069847 | 6.3% |
| s | 1066411 | 6.3% |
| n | 1051888 | 6.2% |
| i | 901575 | 5.3% |
| t | 853198 | 5.0% |
| l | 666293 | 3.9% |
| Other values (85) | 5229041 |
None
| Value | Count | Frequency (%) |
| ú | 4630 | |
| é | 4538 | |
| ê | 4416 | |
| ó | 4352 | |
| í | 3568 | |
| á | 3212 | |
| ñ | 2452 | |
| è | 1544 | 5.0% |
| à | 606 | 2.0% |
| ã | 253 | 0.8% |
| Other values (34) | 1468 | 4.7% |
Punctuation
| Value | Count | Frequency (%) |
| … | 164 | |
| ” | 31 | 12.0% |
| – | 27 | 10.5% |
| “ | 22 | 8.5% |
| — | 14 | 5.4% |
Math Operators
| Value | Count | Frequency (%) |
| ≤ | 1 |
samplingProtocol
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 4515660 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 400.0 |
|---|
| Value | Count | Frequency (%) |
| 400.0 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3 | |
| 4 | 1 | 20.0% |
| . | 1 | 20.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4 | |
| Other Punctuation | 1 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3 | |
| 4 | 1 | 25.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3 | |
| 4 | 1 | 20.0% |
| . | 1 | 20.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3 | |
| 4 | 1 | 20.0% |
| . | 1 | 20.0% |
sampleSizeValue
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 4515659 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 5.5 |
| Mean length | 5.5 |
| Min length | 5 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 500.0 |
|---|---|
| 2nd row | 1000.0 |
| Value | Count | Frequency (%) |
| 500.0 | 1 | |
| 1000.0 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 7 | |
| . | 2 | 18.2% |
| 5 | 1 | 9.1% |
| 1 | 1 | 9.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 9 | |
| Other Punctuation | 2 | 18.2% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 7 | |
| 5 | 1 | 11.1% |
| 1 | 1 | 11.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 11 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 7 | |
| . | 2 | 18.2% |
| 5 | 1 | 9.1% |
| 1 | 1 | 9.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 7 | |
| . | 2 | 18.2% |
| 5 | 1 | 9.1% |
| 1 | 1 | 9.1% |
locationID
Text
Missing 
| Distinct | 1108 |
|---|---|
| Distinct (%) | 2.7% |
| Missing | 4473993 |
| Missing (%) | 99.1% |
| Memory size | 34.5 MiB |
Length
| Max length | 37 |
|---|---|
| Median length | 5 |
| Mean length | 5.995968129 |
| Min length | 1 |
Unique
| Unique | 362 ? |
|---|---|
| Unique (%) | 0.9% |
Sample
| 1st row | 66-10 |
|---|---|
| 2nd row | 69-11 |
| 3rd row | 64-51 |
| 4th row | 66-14 |
| 5th row | 64-34 |
| Value | Count | Frequency (%) |
| station | 4948 | 10.3% |
| ms04 | 1735 | 3.6% |
| 66-24 | 1381 | 2.9% |
| 61 | 946 | 2.0% |
| 64-48 | 628 | 1.3% |
| 64-47 | 588 | 1.2% |
| 69-14 | 562 | 1.2% |
| 66-39 | 462 | 1.0% |
| 66-28 | 442 | 0.9% |
| 66-17 | 426 | 0.9% |
| Other values (1007) | 36062 |
Most occurring characters
| Value | Count | Frequency (%) |
| 6 | 43817 | |
| - | 36322 | |
| 4 | 21703 | 8.7% |
| 2 | 20165 | 8.1% |
| 1 | 18260 | 7.3% |
| 0 | 15408 | 6.2% |
| 3 | 11182 | 4.5% |
| 7 | 10459 | 4.2% |
| t | 10132 | 4.1% |
| 8 | 7703 | 3.1% |
| Other values (62) | 54689 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 160264 | |
| Dash Punctuation | 36322 | 14.5% |
| Lowercase Letter | 31510 | 12.6% |
| Uppercase Letter | 14701 | 5.9% |
| Space Separator | 6512 | 2.6% |
| Connector Punctuation | 289 | 0.1% |
| Close Punctuation | 104 | < 0.1% |
| Open Punctuation | 104 | < 0.1% |
| Other Punctuation | 33 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 7597 | |
| M | 1888 | 12.8% |
| A | 1171 | 8.0% |
| I | 758 | 5.2% |
| K | 734 | 5.0% |
| N | 501 | 3.4% |
| T | 368 | 2.5% |
| H | 276 | 1.9% |
| O | 207 | 1.4% |
| B | 205 | 1.4% |
| Other values (16) | 996 | 6.8% |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 10132 | |
| o | 5161 | |
| n | 5158 | |
| i | 5089 | |
| a | 5076 | |
| e | 241 | 0.8% |
| r | 113 | 0.4% |
| l | 104 | 0.3% |
| d | 82 | 0.3% |
| s | 80 | 0.3% |
| Other values (12) | 274 | 0.9% |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 43817 | |
| 4 | 21703 | |
| 2 | 20165 | |
| 1 | 18260 | |
| 0 | 15408 | 9.6% |
| 3 | 11182 | 7.0% |
| 7 | 10459 | 6.5% |
| 8 | 7703 | 4.8% |
| 9 | 5892 | 3.7% |
| 5 | 5675 | 3.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 14 | |
| / | 9 | |
| & | 4 | 12.1% |
| . | 4 | 12.1% |
| ? | 1 | 3.0% |
| \ | 1 | 3.0% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 102 | |
| ] | 2 | 1.9% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 102 | |
| [ | 2 | 1.9% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 36322 |
Space Separator
| Value | Count | Frequency (%) |
| 6512 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 289 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 203629 | |
| Latin | 46211 | 18.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 10132 | |
| S | 7597 | |
| o | 5161 | |
| n | 5158 | |
| i | 5089 | |
| a | 5076 | |
| M | 1888 | 4.1% |
| A | 1171 | 2.5% |
| I | 758 | 1.6% |
| K | 734 | 1.6% |
| Other values (38) | 3447 | 7.5% |
Common
| Value | Count | Frequency (%) |
| 6 | 43817 | |
| - | 36322 | |
| 4 | 21703 | |
| 2 | 20165 | |
| 1 | 18260 | |
| 0 | 15408 | 7.6% |
| 3 | 11182 | 5.5% |
| 7 | 10459 | 5.1% |
| 8 | 7703 | 3.8% |
| 6512 | 3.2% | |
| Other values (14) | 12098 | 5.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 249840 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 6 | 43817 | |
| - | 36322 | |
| 4 | 21703 | 8.7% |
| 2 | 20165 | 8.1% |
| 1 | 18260 | 7.3% |
| 0 | 15408 | 6.2% |
| 3 | 11182 | 4.5% |
| 7 | 10459 | 4.2% |
| t | 10132 | 4.1% |
| 8 | 7703 | 3.1% |
| Other values (62) | 54689 |
higherGeography
Text
| Distinct | 29227 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 38628 |
| Missing (%) | 0.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 136 |
|---|---|
| Median length | 118 |
| Mean length | 40.94742545 |
| Min length | 4 |
Unique
| Unique | 9052 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | North America, United States, Florida |
|---|---|
| 2nd row | South America - Neotropics, Peru, Piura |
| 3rd row | South America, Argentina, Formosa |
| 4th row | South America - Neotropics, Venezuela, Carabobo |
| 5th row | Africa, South Africa |
| Value | Count | Frequency (%) |
| america | 3037857 | 12.5% |
| north | 1751165 | 7.2% |
| 1668629 | 6.8% | |
| neotropics | 1604866 | 6.6% |
| united | 1351633 | 5.5% |
| states | 1342475 | 5.5% |
| south | 1161951 | 4.8% |
| mexico | 330187 | 1.4% |
| asia-tropical | 303950 | 1.2% |
| brazil | 299921 | 1.2% |
| Other values (15123) | 11516200 |
Most occurring characters
| Value | Count | Frequency (%) |
| 19891801 | 10.9% | |
| a | 16902477 | 9.2% |
| i | 13659346 | 7.5% |
| e | 13427066 | 7.3% |
| r | 11621631 | 6.3% |
| t | 11453417 | 6.2% |
| o | 11169195 | 6.1% |
| , | 9361908 | 5.1% |
| n | 7219729 | 3.9% |
| c | 7113809 | 3.9% |
| Other values (153) | 61502596 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 128373886 | |
| Uppercase Letter | 23155087 | 12.6% |
| Space Separator | 19891801 | 10.9% |
| Other Punctuation | 9561674 | 5.2% |
| Dash Punctuation | 2254209 | 1.2% |
| Close Punctuation | 42797 | < 0.1% |
| Open Punctuation | 42786 | < 0.1% |
| Modifier Letter | 501 | < 0.1% |
| Modifier Symbol | 124 | < 0.1% |
| Decimal Number | 105 | < 0.1% |
| Other values (2) | 5 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 16902477 | |
| i | 13659346 | |
| e | 13427066 | |
| r | 11621631 | |
| t | 11453417 | |
| o | 11169195 | |
| n | 7219729 | 5.6% |
| c | 7113809 | 5.5% |
| s | 6918837 | 5.4% |
| m | 4346665 | 3.4% |
| Other values (72) | 24541714 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 4555726 | |
| N | 3785910 | |
| S | 3179005 | |
| C | 1820326 | 7.9% |
| U | 1509030 | 6.5% |
| M | 989360 | 4.3% |
| P | 911762 | 3.9% |
| I | 836682 | 3.6% |
| T | 836579 | 3.6% |
| B | 695897 | 3.0% |
| Other values (38) | 4034810 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 9361908 | |
| . | 135896 | 1.4% |
| ' | 40694 | 0.4% |
| / | 21497 | 0.2% |
| ? | 1508 | < 0.1% |
| & | 139 | < 0.1% |
| " | 14 | < 0.1% |
| ; | 9 | < 0.1% |
| \ | 4 | < 0.1% |
| : | 3 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 46 | |
| 2 | 46 | |
| 3 | 5 | 4.8% |
| 6 | 3 | 2.9% |
| 4 | 2 | 1.9% |
| 7 | 2 | 1.9% |
| 9 | 1 | 1.0% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 26456 | |
| ] | 16340 | |
| } | 1 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2254206 | |
| – | 3 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 26444 | |
| [ | 16342 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʻ | 442 | |
| ʼ | 59 | 11.8% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 123 | |
| ¸ | 1 | 0.8% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 1 | |
| + | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 19891801 |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ́ | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 151528973 | |
| Common | 31793999 | 17.3% |
| Inherited | 3 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 16902477 | 11.2% |
| i | 13659346 | 9.0% |
| e | 13427066 | 8.9% |
| r | 11621631 | 7.7% |
| t | 11453417 | 7.6% |
| o | 11169195 | 7.4% |
| n | 7219729 | 4.8% |
| c | 7113809 | 4.7% |
| s | 6918837 | 4.6% |
| A | 4555726 | 3.0% |
| Other values (120) | 47487740 |
Common
| Value | Count | Frequency (%) |
| 19891801 | ||
| , | 9361908 | |
| - | 2254206 | 7.1% |
| . | 135896 | 0.4% |
| ' | 40694 | 0.1% |
| ) | 26456 | 0.1% |
| ( | 26444 | 0.1% |
| / | 21497 | 0.1% |
| [ | 16342 | 0.1% |
| ] | 16340 | 0.1% |
| Other values (22) | 2415 | < 0.1% |
Inherited
| Value | Count | Frequency (%) |
| ́ | 3 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 182844363 | |
| None | 478071 | 0.3% |
| Modifier Letters | 501 | < 0.1% |
| Latin Ext Additional | 34 | < 0.1% |
| Punctuation | 3 | < 0.1% |
| Diacriticals | 3 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 19891801 | 10.9% | |
| a | 16902477 | 9.2% |
| i | 13659346 | 7.5% |
| e | 13427066 | 7.3% |
| r | 11621631 | 6.4% |
| t | 11453417 | 6.3% |
| o | 11169195 | 6.1% |
| , | 9361908 | 5.1% |
| n | 7219729 | 3.9% |
| c | 7113809 | 3.9% |
| Other values (68) | 61023984 |
None
| Value | Count | Frequency (%) |
| á | 158373 | |
| í | 91518 | |
| é | 80944 | |
| ó | 57377 | 12.0% |
| ã | 28755 | 6.0% |
| ô | 13770 | 2.9% |
| ç | 8254 | 1.7% |
| ñ | 7290 | 1.5% |
| Î | 6683 | 1.4% |
| ü | 5257 | 1.1% |
| Other values (59) | 19850 | 4.2% |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 442 | |
| ʼ | 59 | 11.8% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ḍ | 10 | |
| ṭ | 8 | |
| ồ | 3 | 8.8% |
| ị | 2 | 5.9% |
| ế | 2 | 5.9% |
| ả | 2 | 5.9% |
| ộ | 2 | 5.9% |
| ẵ | 1 | 2.9% |
| ḑ | 1 | 2.9% |
| ừ | 1 | 2.9% |
| Other values (2) | 2 | 5.9% |
Punctuation
| Value | Count | Frequency (%) |
| – | 3 |
Diacriticals
| Value | Count | Frequency (%) |
| ́ | 3 |
continent
Text
Missing 
| Distinct | 72 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 66158 |
| Missing (%) | 1.5% |
| Memory size | 34.5 MiB |
Length
| Max length | 57 |
|---|---|
| Median length | 50 |
| Mean length | 17.22849923 |
| Min length | 4 |
Unique
| Unique | 17 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | North America |
|---|---|
| 2nd row | South America - Neotropics |
| 3rd row | South America |
| 4th row | South America - Neotropics |
| 5th row | Africa |
| Value | Count | Frequency (%) |
| america | 3037856 | |
| north | 1705271 | |
| 1605982 | ||
| neotropics | 1604866 | |
| south | 1078261 | 9.7% |
| asia-tropical | 303950 | 2.7% |
| central | 269843 | 2.4% |
| west | 265195 | 2.4% |
| indies | 265195 | 2.4% |
| europe | 230947 | 2.1% |
| Other values (19) | 796938 | 7.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 7607800 | 9.9% |
| 6714801 | 8.8% | |
| o | 6532692 | 8.5% |
| e | 6336991 | 8.3% |
| i | 6293342 | 8.2% |
| c | 5457494 | 7.1% |
| t | 5250425 | 6.8% |
| a | 5072337 | 6.6% |
| A | 3811578 | 5.0% |
| N | 3310135 | 4.3% |
| Other values (30) | 20270664 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 57722814 | |
| Uppercase Letter | 10079210 | 13.1% |
| Space Separator | 6714801 | 8.8% |
| Dash Punctuation | 2124592 | 2.8% |
| Other Punctuation | 16840 | < 0.1% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 7607800 | |
| o | 6532692 | |
| e | 6336991 | |
| i | 6293342 | |
| c | 5457494 | |
| t | 5250425 | |
| a | 5072337 | |
| m | 3253669 | |
| s | 3104060 | |
| h | 2783537 | 4.8% |
| Other values (10) | 6030467 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 3811578 | |
| N | 3310135 | |
| S | 1078416 | 10.7% |
| T | 519589 | 5.2% |
| I | 418416 | 4.2% |
| C | 269978 | 2.7% |
| W | 265195 | 2.6% |
| E | 230947 | 2.3% |
| P | 154152 | 1.5% |
| O | 15874 | 0.2% |
| Other values (3) | 4930 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 15724 | |
| / | 1115 | 6.6% |
| ? | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 6714801 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2124592 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 67802024 | |
| Common | 8856235 | 11.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 7607800 | |
| o | 6532692 | |
| e | 6336991 | |
| i | 6293342 | |
| c | 5457494 | 8.0% |
| t | 5250425 | 7.7% |
| a | 5072337 | 7.5% |
| A | 3811578 | 5.6% |
| N | 3310135 | 4.9% |
| m | 3253669 | 4.8% |
| Other values (23) | 14875561 |
Common
| Value | Count | Frequency (%) |
| 6714801 | ||
| - | 2124592 | 24.0% |
| , | 15724 | 0.2% |
| / | 1115 | < 0.1% |
| ( | 1 | < 0.1% |
| ? | 1 | < 0.1% |
| ) | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 76658259 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 7607800 | 9.9% |
| 6714801 | 8.8% | |
| o | 6532692 | 8.5% |
| e | 6336991 | 8.3% |
| i | 6293342 | 8.2% |
| c | 5457494 | 7.1% |
| t | 5250425 | 6.8% |
| a | 5072337 | 6.6% |
| A | 3811578 | 5.0% |
| N | 3310135 | 4.3% |
| Other values (30) | 20270664 |
waterBody
Text
Missing 
| Distinct | 146 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 4496088 |
| Missing (%) | 99.6% |
| Memory size | 34.5 MiB |
Length
| Max length | 62 |
|---|---|
| Median length | 61 |
| Mean length | 26.33765902 |
| Min length | 4 |
Unique
| Unique | 67 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | North Atlantic Ocean, Bay of Fundy |
|---|---|
| 2nd row | North Atlantic Ocean, Caribbean Sea |
| 3rd row | North Atlantic Ocean, Gulf of Maine, Englishman Bay/Mack Cove |
| 4th row | North Atlantic Ocean, Caribbean Sea |
| 5th row | North Pacific Ocean |
| Value | Count | Frequency (%) |
| ocean | 15739 | |
| north | 15146 | |
| atlantic | 14293 | |
| sea | 7188 | |
| caribbean | 6009 | 7.5% |
| of | 3765 | 4.7% |
| gulf | 3584 | 4.5% |
| maine | 2788 | 3.5% |
| bay | 2525 | 3.2% |
| pacific | 1232 | 1.5% |
| Other values (153) | 7583 |
Most occurring characters
| Value | Count | Frequency (%) |
| 60279 | ||
| a | 59587 | |
| t | 46425 | 9.0% |
| n | 42137 | 8.2% |
| e | 36547 | 7.1% |
| c | 34220 | 6.6% |
| i | 27759 | 5.4% |
| r | 23435 | 4.5% |
| o | 23395 | 4.5% |
| l | 19298 | 3.7% |
| Other values (48) | 142425 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 367426 | |
| Uppercase Letter | 76511 | 14.8% |
| Space Separator | 60279 | 11.7% |
| Other Punctuation | 10843 | 2.1% |
| Modifier Letter | 442 | 0.1% |
| Dash Punctuation | 4 | < 0.1% |
| Decimal Number | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 59587 | |
| t | 46425 | |
| n | 42137 | |
| e | 36547 | |
| c | 34220 | |
| i | 27759 | |
| r | 23435 | 6.4% |
| o | 23395 | 6.4% |
| l | 19298 | 5.3% |
| h | 17411 | 4.7% |
| Other values (16) | 37212 |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 15980 | |
| N | 15149 | |
| A | 14449 | |
| S | 8325 | |
| C | 7694 | |
| G | 3958 | 5.2% |
| B | 3245 | 4.2% |
| M | 3172 | 4.1% |
| P | 1964 | 2.6% |
| I | 578 | 0.8% |
| Other values (13) | 1997 | 2.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 10312 | |
| / | 426 | 3.9% |
| ' | 104 | 1.0% |
| . | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 1 | |
| 4 | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 60279 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʻ | 442 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 443937 | |
| Common | 71570 | 13.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 59587 | |
| t | 46425 | |
| n | 42137 | 9.5% |
| e | 36547 | 8.2% |
| c | 34220 | 7.7% |
| i | 27759 | 6.3% |
| r | 23435 | 5.3% |
| o | 23395 | 5.3% |
| l | 19298 | 4.3% |
| h | 17411 | 3.9% |
| Other values (39) | 113723 |
Common
| Value | Count | Frequency (%) |
| 60279 | ||
| , | 10312 | 14.4% |
| ʻ | 442 | 0.6% |
| / | 426 | 0.6% |
| ' | 104 | 0.1% |
| - | 4 | < 0.1% |
| 3 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
| . | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 514623 | |
| Modifier Letters | 442 | 0.1% |
| None | 442 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 60279 | ||
| a | 59587 | |
| t | 46425 | 9.0% |
| n | 42137 | 8.2% |
| e | 36547 | 7.1% |
| c | 34220 | 6.6% |
| i | 27759 | 5.4% |
| r | 23435 | 4.6% |
| o | 23395 | 4.5% |
| l | 19298 | 3.7% |
| Other values (46) | 141541 |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 442 |
None
| Value | Count | Frequency (%) |
| ā | 442 |
islandGroup
Text
Missing 
| Distinct | 535 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 4403077 |
| Missing (%) | 97.5% |
| Memory size | 34.5 MiB |
Length
| Max length | 42 |
|---|---|
| Median length | 41 |
| Mean length | 14.86225396 |
| Min length | 3 |
Unique
| Unique | 124 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Greater Antilles |
|---|---|
| 2nd row | Greater Antilles |
| 3rd row | Elizabeth Islands |
| 4th row | Channel Islands |
| 5th row | Greater Antilles |
| Value | Count | Frequency (%) |
| antilles | 32223 | 12.5% |
| greater | 32220 | 12.5% |
| islands | 23341 | 9.1% |
| is | 19916 | 7.7% |
| group | 16200 | 6.3% |
| new | 7374 | 2.9% |
| guinea | 6014 | 2.3% |
| channel | 5394 | 2.1% |
| keys | 5332 | 2.1% |
| florida | 5071 | 2.0% |
| Other values (442) | 104053 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 168595 | 10.1% |
| a | 153468 | 9.2% |
| 144554 | 8.6% | |
| s | 133119 | 8.0% |
| l | 127504 | 7.6% |
| r | 120065 | 7.2% |
| n | 113308 | 6.8% |
| t | 87870 | 5.3% |
| i | 82774 | 4.9% |
| G | 60256 | 3.6% |
| Other values (54) | 481739 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1252200 | |
| Uppercase Letter | 250128 | 14.9% |
| Space Separator | 144554 | 8.6% |
| Other Punctuation | 20643 | 1.2% |
| Open Punctuation | 2855 | 0.2% |
| Close Punctuation | 2855 | 0.2% |
| Dash Punctuation | 17 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 168595 | |
| a | 153468 | |
| s | 133119 | |
| l | 127504 | |
| r | 120065 | |
| n | 113308 | |
| t | 87870 | |
| i | 82774 | |
| u | 49516 | 4.0% |
| d | 48781 | 3.9% |
| Other values (18) | 167200 |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 60256 | |
| I | 47120 | |
| A | 36666 | |
| C | 15661 | 6.3% |
| V | 13535 | 5.4% |
| L | 11432 | 4.6% |
| N | 9696 | 3.9% |
| S | 8478 | 3.4% |
| K | 6484 | 2.6% |
| F | 6018 | 2.4% |
| Other values (16) | 34782 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 19898 | |
| ' | 710 | 3.4% |
| , | 30 | 0.1% |
| ? | 5 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 1851 | |
| ( | 1004 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 1851 | |
| ) | 1004 |
Space Separator
| Value | Count | Frequency (%) |
| 144554 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 17 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1502328 | |
| Common | 170924 | 10.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 168595 | |
| a | 153468 | 10.2% |
| s | 133119 | 8.9% |
| l | 127504 | 8.5% |
| r | 120065 | 8.0% |
| n | 113308 | 7.5% |
| t | 87870 | 5.8% |
| i | 82774 | 5.5% |
| G | 60256 | 4.0% |
| u | 49516 | 3.3% |
| Other values (44) | 405853 |
Common
| Value | Count | Frequency (%) |
| 144554 | ||
| . | 19898 | 11.6% |
| [ | 1851 | 1.1% |
| ] | 1851 | 1.1% |
| ( | 1004 | 0.6% |
| ) | 1004 | 0.6% |
| ' | 710 | 0.4% |
| , | 30 | < 0.1% |
| - | 17 | < 0.1% |
| ? | 5 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1664297 | |
| None | 8955 | 0.5% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 168595 | 10.1% |
| a | 153468 | 9.2% |
| 144554 | 8.7% | |
| s | 133119 | 8.0% |
| l | 127504 | 7.7% |
| r | 120065 | 7.2% |
| n | 113308 | 6.8% |
| t | 87870 | 5.3% |
| i | 82774 | 5.0% |
| G | 60256 | 3.6% |
| Other values (51) | 472784 |
None
| Value | Count | Frequency (%) |
| Î | 4724 | |
| á | 4230 | |
| ñ | 1 | < 0.1% |
island
Text
Missing 
| Distinct | 4293 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 4139168 |
| Missing (%) | 91.7% |
| Memory size | 34.5 MiB |
Length
| Max length | 48 |
|---|---|
| Median length | 43 |
| Mean length | 9.545619706 |
| Min length | 1 |
Unique
| Unique | 1311 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | Rota |
|---|---|
| 2nd row | Hispaniola |
| 3rd row | North Island |
| 4th row | Kaua'i |
| 5th row | Hispaniola Island |
| Value | Count | Frequency (%) |
| hispaniola | 49160 | 8.5% |
| island | 45105 | 7.8% |
| cuba | 23045 | 4.0% |
| oahu | 17038 | 2.9% |
| st | 12448 | 2.2% |
| kaua'i | 12005 | 2.1% |
| new | 10482 | 1.8% |
| isla | 9883 | 1.7% |
| jamaica | 9865 | 1.7% |
| luzon | 9652 | 1.7% |
| Other values (3257) | 379339 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 572883 | |
| i | 282904 | 7.9% |
| n | 241731 | 6.7% |
| o | 215280 | 6.0% |
| 201529 | 5.6% | |
| l | 189222 | 5.3% |
| u | 174379 | 4.9% |
| e | 170799 | 4.8% |
| s | 161533 | 4.5% |
| r | 127280 | 3.5% |
| Other values (76) | 1256319 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2754132 | |
| Uppercase Letter | 562682 | 15.7% |
| Space Separator | 201529 | 5.6% |
| Other Punctuation | 42453 | 1.2% |
| Close Punctuation | 15796 | 0.4% |
| Open Punctuation | 15786 | 0.4% |
| Dash Punctuation | 1476 | < 0.1% |
| Decimal Number | 4 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 572883 | |
| i | 282904 | |
| n | 241731 | |
| o | 215280 | 7.8% |
| l | 189222 | 6.9% |
| u | 174379 | 6.3% |
| e | 170799 | 6.2% |
| s | 161533 | 5.9% |
| r | 127280 | 4.6% |
| t | 109023 | 4.0% |
| Other values (32) | 509098 |
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 73966 | |
| I | 67946 | |
| C | 61093 | 10.9% |
| S | 44330 | 7.9% |
| M | 34219 | 6.1% |
| T | 25587 | 4.5% |
| B | 25543 | 4.5% |
| G | 25041 | 4.5% |
| K | 23090 | 4.1% |
| O | 22999 | 4.1% |
| Other values (19) | 158868 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 20682 | |
| . | 20243 | |
| , | 1360 | 3.2% |
| ? | 155 | 0.4% |
| / | 12 | < 0.1% |
| & | 1 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 13359 | |
| ( | 2427 | 15.4% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 13357 | |
| ) | 2439 | 15.4% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 6 | 2 |
Space Separator
| Value | Count | Frequency (%) |
| 201529 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1476 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3316814 | |
| Common | 277045 | 7.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 572883 | |
| i | 282904 | 8.5% |
| n | 241731 | 7.3% |
| o | 215280 | 6.5% |
| l | 189222 | 5.7% |
| u | 174379 | 5.3% |
| e | 170799 | 5.1% |
| s | 161533 | 4.9% |
| r | 127280 | 3.8% |
| t | 109023 | 3.3% |
| Other values (61) | 1071780 |
Common
| Value | Count | Frequency (%) |
| 201529 | ||
| ' | 20682 | 7.5% |
| . | 20243 | 7.3% |
| [ | 13359 | 4.8% |
| ] | 13357 | 4.8% |
| ) | 2439 | 0.9% |
| ( | 2427 | 0.9% |
| - | 1476 | 0.5% |
| , | 1360 | 0.5% |
| ? | 155 | 0.1% |
| Other values (5) | 18 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3587365 | |
| None | 6494 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 572883 | |
| i | 282904 | 7.9% |
| n | 241731 | 6.7% |
| o | 215280 | 6.0% |
| 201529 | 5.6% | |
| l | 189222 | 5.3% |
| u | 174379 | 4.9% |
| e | 170799 | 4.8% |
| s | 161533 | 4.5% |
| r | 127280 | 3.5% |
| Other values (56) | 1249825 |
None
| Value | Count | Frequency (%) |
| ç | 1813 | |
| Î | 1610 | |
| é | 899 | |
| ó | 801 | |
| á | 452 | 7.0% |
| â | 387 | 6.0% |
| ñ | 238 | 3.7% |
| ã | 150 | 2.3% |
| Ö | 58 | 0.9% |
| í | 35 | 0.5% |
| Other values (10) | 51 | 0.8% |
country
Text
| Distinct | 460 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 38684 |
| Missing (%) | 0.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 51 |
|---|---|
| Median length | 50 |
| Mean length | 9.388789355 |
| Min length | 4 |
Unique
| Unique | 77 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | United States |
|---|---|
| 2nd row | Peru |
| 3rd row | Argentina |
| 4th row | Venezuela |
| 5th row | South Africa |
| Value | Count | Frequency (%) |
| united | 1351629 | |
| states | 1342475 | |
| brazil | 299921 | 4.7% |
| mexico | 290196 | 4.5% |
| colombia | 165032 | 2.6% |
| venezuela | 119690 | 1.9% |
| peru | 116267 | 1.8% |
| canada | 113187 | 1.8% |
| china | 108364 | 1.7% |
| ecuador | 89292 | 1.4% |
| Other values (309) | 2403309 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 5086492 | |
| t | 4552689 | 10.8% |
| e | 4466872 | 10.6% |
| i | 3789420 | 9.0% |
| n | 3120571 | 7.4% |
| d | 1943611 | 4.6% |
| 1922385 | 4.6% | |
| s | 1851829 | 4.4% |
| S | 1538403 | 3.7% |
| U | 1421487 | 3.4% |
| Other values (55) | 12339635 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 33687190 | |
| Uppercase Letter | 6371529 | 15.2% |
| Space Separator | 1922385 | 4.6% |
| Other Punctuation | 46937 | 0.1% |
| Dash Punctuation | 4237 | < 0.1% |
| Close Punctuation | 558 | < 0.1% |
| Open Punctuation | 558 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 5086492 | |
| t | 4552689 | |
| e | 4466872 | |
| i | 3789420 | |
| n | 3120571 | |
| d | 1943611 | 5.8% |
| s | 1851829 | 5.5% |
| o | 1417591 | 4.2% |
| r | 1215020 | 3.6% |
| l | 1213250 | 3.6% |
| Other values (20) | 5029845 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 1538403 | |
| U | 1421487 | |
| C | 581180 | 9.1% |
| B | 414386 | 6.5% |
| P | 408939 | 6.4% |
| M | 379429 | 6.0% |
| G | 256961 | 4.0% |
| R | 184136 | 2.9% |
| A | 180790 | 2.8% |
| I | 146545 | 2.3% |
| Other values (15) | 859273 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 25759 | |
| , | 18129 | |
| / | 2903 | 6.2% |
| ? | 146 | 0.3% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 482 | |
| ] | 76 | 13.6% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 482 | |
| [ | 76 | 13.6% |
Space Separator
| Value | Count | Frequency (%) |
| 1922385 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4237 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 40058719 | |
| Common | 1974675 | 4.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 5086492 | |
| t | 4552689 | |
| e | 4466872 | |
| i | 3789420 | 9.5% |
| n | 3120571 | 7.8% |
| d | 1943611 | 4.9% |
| s | 1851829 | 4.6% |
| S | 1538403 | 3.8% |
| U | 1421487 | 3.5% |
| o | 1417591 | 3.5% |
| Other values (45) | 10869754 |
Common
| Value | Count | Frequency (%) |
| 1922385 | ||
| . | 25759 | 1.3% |
| , | 18129 | 0.9% |
| - | 4237 | 0.2% |
| / | 2903 | 0.1% |
| ) | 482 | < 0.1% |
| ( | 482 | < 0.1% |
| ? | 146 | < 0.1% |
| [ | 76 | < 0.1% |
| ] | 76 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 42027851 | |
| None | 5543 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 5086492 | |
| t | 4552689 | 10.8% |
| e | 4466872 | 10.6% |
| i | 3789420 | 9.0% |
| n | 3120571 | 7.4% |
| d | 1943611 | 4.6% |
| 1922385 | 4.6% | |
| s | 1851829 | 4.4% |
| S | 1538403 | 3.7% |
| U | 1421487 | 3.4% |
| Other values (51) | 12334092 |
None
| Value | Count | Frequency (%) |
| é | 3137 | |
| ç | 2098 | |
| á | 236 | 4.3% |
| ã | 72 | 1.3% |
countryCode
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 4515660 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 1872 |
|---|
| Value | Count | Frequency (%) |
| 1872 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 8 | 1 | |
| 7 | 1 | |
| 2 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 8 | 1 | |
| 7 | 1 | |
| 2 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 8 | 1 | |
| 7 | 1 | |
| 2 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 8 | 1 | |
| 7 | 1 | |
| 2 | 1 |
stateProvince
Text
Missing 
| Distinct | 4563 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 1002183 |
| Missing (%) | 22.2% |
| Memory size | 34.5 MiB |
Length
| Max length | 63 |
|---|---|
| Median length | 51 |
| Mean length | 9.008130405 |
| Min length | 1 |
Unique
| Unique | 1056 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Florida |
|---|---|
| 2nd row | Piura |
| 3rd row | Formosa |
| 4th row | Carabobo |
| 5th row | Manabí |
| Value | Count | Frequency (%) |
| california | 201997 | 4.4% |
| new | 105413 | 2.3% |
| florida | 88714 | 1.9% |
| virginia | 72232 | 1.6% |
| texas | 71235 | 1.5% |
| alaska | 67186 | 1.5% |
| amazonas | 60909 | 1.3% |
| hawaii | 55246 | 1.2% |
| san | 50578 | 1.1% |
| arizona | 50395 | 1.1% |
| Other values (3812) | 3798781 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 4911736 | |
| i | 2591410 | 8.2% |
| n | 2327641 | 7.4% |
| o | 2312232 | 7.3% |
| r | 2011322 | 6.4% |
| e | 1596527 | 5.0% |
| s | 1274100 | 4.0% |
| l | 1254624 | 4.0% |
| t | 1112877 | 3.5% |
| 1109208 | 3.5% | |
| Other values (134) | 11148191 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 25675252 | |
| Uppercase Letter | 4615949 | 14.6% |
| Space Separator | 1109208 | 3.5% |
| Dash Punctuation | 117663 | 0.4% |
| Other Punctuation | 86259 | 0.3% |
| Open Punctuation | 22735 | 0.1% |
| Close Punctuation | 22735 | 0.1% |
| Modifier Letter | 59 | < 0.1% |
| Decimal Number | 6 | < 0.1% |
| Modifier Symbol | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4911736 | |
| i | 2591410 | |
| n | 2327641 | |
| o | 2312232 | |
| r | 2011322 | 7.8% |
| e | 1596527 | 6.2% |
| s | 1274100 | 5.0% |
| l | 1254624 | 4.9% |
| t | 1112877 | 4.3% |
| u | 1059938 | 4.1% |
| Other values (72) | 5222845 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 714018 | |
| M | 451601 | 9.8% |
| S | 382515 | 8.3% |
| A | 361688 | 7.8% |
| N | 307507 | 6.7% |
| P | 258383 | 5.6% |
| T | 197076 | 4.3% |
| B | 180196 | 3.9% |
| V | 178962 | 3.9% |
| L | 157384 | 3.4% |
| Other values (34) | 1426619 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 64175 | |
| / | 13649 | 15.8% |
| , | 4058 | 4.7% |
| ' | 3620 | 4.2% |
| ? | 639 | 0.7% |
| & | 118 | 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 21809 | |
| ] | 925 | 4.1% |
| } | 1 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 21809 | |
| [ | 926 | 4.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 4 | |
| 7 | 2 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ¸ | 1 | |
| ´ | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 1109208 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 117663 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʼ | 59 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 30291201 | |
| Common | 1358667 | 4.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 4911736 | |
| i | 2591410 | 8.6% |
| n | 2327641 | 7.7% |
| o | 2312232 | 7.6% |
| r | 2011322 | 6.6% |
| e | 1596527 | 5.3% |
| s | 1274100 | 4.2% |
| l | 1254624 | 4.1% |
| t | 1112877 | 3.7% |
| u | 1059938 | 3.5% |
| Other values (116) | 9838794 |
Common
| Value | Count | Frequency (%) |
| 1109208 | ||
| - | 117663 | 8.7% |
| . | 64175 | 4.7% |
| ( | 21809 | 1.6% |
| ) | 21809 | 1.6% |
| / | 13649 | 1.0% |
| , | 4058 | 0.3% |
| ' | 3620 | 0.3% |
| [ | 926 | 0.1% |
| ] | 925 | 0.1% |
| Other values (8) | 825 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 31262419 | |
| None | 387356 | 1.2% |
| Modifier Letters | 59 | < 0.1% |
| Latin Ext Additional | 34 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 4911736 | |
| i | 2591410 | 8.3% |
| n | 2327641 | 7.4% |
| o | 2312232 | 7.4% |
| r | 2011322 | 6.4% |
| e | 1596527 | 5.1% |
| s | 1274100 | 4.1% |
| l | 1254624 | 4.0% |
| t | 1112877 | 3.6% |
| 1109208 | 3.5% | |
| Other values (57) | 10760742 |
None
| Value | Count | Frequency (%) |
| á | 140128 | |
| í | 81452 | |
| é | 62681 | |
| ó | 43934 | 11.3% |
| ã | 21483 | 5.5% |
| ô | 12517 | 3.2% |
| ñ | 5935 | 1.5% |
| ü | 4328 | 1.1% |
| ä | 2589 | 0.7% |
| ö | 2183 | 0.6% |
| Other values (54) | 10126 | 2.6% |
Modifier Letters
| Value | Count | Frequency (%) |
| ʼ | 59 |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ḍ | 10 | |
| ṭ | 8 | |
| ồ | 3 | 8.8% |
| ộ | 2 | 5.9% |
| ế | 2 | 5.9% |
| ả | 2 | 5.9% |
| ị | 2 | 5.9% |
| ẵ | 1 | 2.9% |
| ừ | 1 | 2.9% |
| ậ | 1 | 2.9% |
| Other values (2) | 2 | 5.9% |
county
Text
Missing 
| Distinct | 12276 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 3778676 |
| Missing (%) | 83.7% |
| Memory size | 34.5 MiB |
Length
| Max length | 56 |
|---|---|
| Median length | 49 |
| Mean length | 9.15335726 |
| Min length | 1 |
Unique
| Unique | 3590 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | Parroquia |
|---|---|
| 2nd row | Duval |
| 3rd row | Boulder |
| 4th row | Cantal |
| 5th row | Arlington |
| Value | Count | Frequency (%) |
| county | 55931 | 5.4% |
| san | 32851 | 3.2% |
| prince | 19199 | 1.9% |
| honolulu | 19178 | 1.8% |
| santa | 18068 | 1.7% |
| los | 14015 | 1.4% |
| angeles | 13830 | 1.3% |
| montgomery | 13792 | 1.3% |
| george's | 13723 | 1.3% |
| maui | 12914 | 1.2% |
| Other values (9383) | 823511 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 769422 | 11.4% |
| o | 546391 | 8.1% |
| n | 533923 | 7.9% |
| e | 515856 | 7.6% |
| r | 422472 | 6.3% |
| i | 392542 | 5.8% |
| t | 309386 | 4.6% |
| u | 307150 | 4.6% |
| 300027 | 4.4% | |
| l | 289660 | 4.3% |
| Other values (101) | 2359058 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5401031 | |
| Uppercase Letter | 1011313 | 15.0% |
| Space Separator | 300027 | 4.4% |
| Other Punctuation | 25581 | 0.4% |
| Dash Punctuation | 6153 | 0.1% |
| Close Punctuation | 782 | < 0.1% |
| Open Punctuation | 781 | < 0.1% |
| Modifier Symbol | 122 | < 0.1% |
| Decimal Number | 93 | < 0.1% |
| Nonspacing Mark | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 769422 | |
| o | 546391 | |
| n | 533923 | |
| e | 515856 | |
| r | 422472 | 7.8% |
| i | 392542 | 7.3% |
| t | 309386 | 5.7% |
| u | 307150 | 5.7% |
| l | 289660 | 5.4% |
| s | 242628 | 4.5% |
| Other values (38) | 1071601 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 150693 | |
| S | 110876 | |
| M | 104694 | 10.4% |
| B | 69270 | 6.8% |
| P | 63056 | 6.2% |
| A | 60055 | 5.9% |
| H | 55148 | 5.5% |
| L | 50422 | 5.0% |
| G | 40017 | 4.0% |
| F | 34703 | 3.4% |
| Other values (28) | 272379 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 15573 | |
| . | 5768 | 22.5% |
| / | 3392 | 13.3% |
| ? | 562 | 2.2% |
| , | 248 | 1.0% |
| & | 20 | 0.1% |
| ; | 9 | < 0.1% |
| \ | 4 | < 0.1% |
| : | 3 | < 0.1% |
| ¡ | 2 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 46 | |
| 2 | 44 | |
| 6 | 1 | 1.1% |
| 4 | 1 | 1.1% |
| 9 | 1 | 1.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 6150 | |
| – | 3 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 717 | |
| [ | 64 | 8.2% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 717 | |
| ] | 65 | 8.3% |
Space Separator
| Value | Count | Frequency (%) |
| 300027 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 122 |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ́ | 3 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6412344 | |
| Common | 333540 | 4.9% |
| Inherited | 3 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 769422 | 12.0% |
| o | 546391 | 8.5% |
| n | 533923 | 8.3% |
| e | 515856 | 8.0% |
| r | 422472 | 6.6% |
| i | 392542 | 6.1% |
| t | 309386 | 4.8% |
| u | 307150 | 4.8% |
| l | 289660 | 4.5% |
| s | 242628 | 3.8% |
| Other values (76) | 2082914 |
Common
| Value | Count | Frequency (%) |
| 300027 | ||
| ' | 15573 | 4.7% |
| - | 6150 | 1.8% |
| . | 5768 | 1.7% |
| / | 3392 | 1.0% |
| ( | 717 | 0.2% |
| ) | 717 | 0.2% |
| ? | 562 | 0.2% |
| , | 248 | 0.1% |
| ´ | 122 | < 0.1% |
| Other values (14) | 264 | 0.1% |
Inherited
| Value | Count | Frequency (%) |
| ́ | 3 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6684490 | |
| None | 61391 | 0.9% |
| Punctuation | 3 | < 0.1% |
| Diacriticals | 3 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 769422 | 11.5% |
| o | 546391 | 8.2% |
| n | 533923 | 8.0% |
| e | 515856 | 7.7% |
| r | 422472 | 6.3% |
| i | 392542 | 5.9% |
| t | 309386 | 4.6% |
| u | 307150 | 4.6% |
| 300027 | 4.5% | |
| l | 289660 | 4.3% |
| Other values (63) | 2297661 |
None
| Value | Count | Frequency (%) |
| á | 13327 | |
| é | 10282 | |
| í | 10031 | |
| ó | 8697 | |
| ã | 7050 | |
| ç | 4342 | 7.1% |
| è | 1366 | 2.2% |
| ô | 1253 | 2.0% |
| ñ | 1116 | 1.8% |
| ê | 942 | 1.5% |
| Other values (26) | 2985 | 4.9% |
Punctuation
| Value | Count | Frequency (%) |
| – | 3 |
Diacriticals
| Value | Count | Frequency (%) |
| ́ | 3 |
locality
Text
Missing 
| Distinct | 2190202 |
|---|---|
| Distinct (%) | 52.4% |
| Missing | 332028 |
| Missing (%) | 7.4% |
| Memory size | 34.5 MiB |
Length
| Max length | 239546 |
|---|---|
| Median length | 438 |
| Mean length | 47.49715737 |
| Min length | 1 |
Unique
| Unique | 1771025 ? |
|---|---|
| Unique (%) | 42.3% |
Sample
| 1st row | Gulf of Mexico |
|---|---|
| 2nd row | Dept. Piura: Ayabaca |
| 3rd row | Dep. Pilcomayo. al E a 2 Km de P. Porteño. |
| 4th row | Selva siempre verde en las quebradas al norte de Los Tanques, arriba de la Planta Eléctrica, en las cabeceras del Río San Gián, al sur de Borburata. |
| 5th row | Flat terrain near Skukuza rest camp, Kruger National Park. |
| Value | Count | Frequency (%) |
| of | 1585417 | 5.0% |
| de | 609558 | 1.9% |
| the | 376944 | 1.2% |
| km | 372123 | 1.2% |
| near | 341319 | 1.1% |
| and | 273717 | 0.9% |
| on | 273627 | 0.9% |
| in | 261704 | 0.8% |
| county | 254734 | 0.8% |
| la | 230318 | 0.7% |
| Other values (599581) | 26894148 |
Most occurring characters
| Value | Count | Frequency (%) |
| 27250345 | 13.7% | |
| a | 18162578 | 9.1% |
| e | 14022740 | 7.1% |
| o | 13213905 | 6.6% |
| n | 11023738 | 5.5% |
| i | 10220547 | 5.1% |
| r | 10113701 | 5.1% |
| t | 8788622 | 4.4% |
| l | 7107480 | 3.6% |
| s | 6902603 | 3.5% |
| Other values (414) | 71904416 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 137188370 | |
| Space Separator | 27250345 | 13.7% |
| Uppercase Letter | 20470547 | 10.3% |
| Other Punctuation | 9991899 | 5.0% |
| Decimal Number | 2289002 | 1.2% |
| Dash Punctuation | 522521 | 0.3% |
| Open Punctuation | 365371 | 0.2% |
| Close Punctuation | 362506 | 0.2% |
| Control | 218330 | 0.1% |
| Other Number | 19102 | < 0.1% |
| Other values (11) | 32682 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 18162578 | |
| e | 14022740 | |
| o | 13213905 | |
| n | 11023738 | 8.0% |
| i | 10220547 | 7.5% |
| r | 10113701 | 7.4% |
| t | 8788622 | 6.4% |
| l | 7107480 | 5.2% |
| s | 6902603 | 5.0% |
| u | 5110133 | 3.7% |
| Other values (168) | 32522323 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 2415927 | 11.8% |
| S | 2030001 | 9.9% |
| M | 1536549 | 7.5% |
| P | 1504174 | 7.3% |
| R | 1298250 | 6.3% |
| B | 1168884 | 5.7% |
| A | 1036363 | 5.1% |
| N | 1025732 | 5.0% |
| L | 921587 | 4.5% |
| T | 846871 | 4.1% |
| Other values (85) | 6686209 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 5595503 | |
| , | 3400184 | |
| : | 451617 | 4.5% |
| ; | 188440 | 1.9% |
| ' | 148212 | 1.5% |
| " | 97269 | 1.0% |
| / | 49749 | 0.5% |
| & | 41165 | 0.4% |
| # | 9097 | 0.1% |
| ? | 7631 | 0.1% |
| Other values (12) | 3032 | < 0.1% |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 6828 | |
| ├ | 18 | 0.3% |
| ░ | 6 | 0.1% |
| ┬ | 6 | 0.1% |
| ¦ | 5 | 0.1% |
| │ | 3 | < 0.1% |
| ─ | 3 | < 0.1% |
| ▒ | 2 | < 0.1% |
| ☼ | 2 | < 0.1% |
| ™ | 2 | < 0.1% |
| Other values (5) | 5 | 0.1% |
Control
| Value | Count | Frequency (%) |
| 216904 | ||
| 1148 | 0.5% | |
| 71 | < 0.1% | |
| 65 | < 0.1% | |
| | 39 | < 0.1% |
| | 31 | < 0.1% |
| | 30 | < 0.1% |
| | 27 | < 0.1% |
| | 9 | < 0.1% |
| | 2 | < 0.1% |
| Other values (3) | 4 | < 0.1% |
Other Number
| Value | Count | Frequency (%) |
| ½ | 12766 | |
| ¼ | 5486 | |
| ¾ | 693 | 3.6% |
| ² | 70 | 0.4% |
| ⅓ | 52 | 0.3% |
| ⅛ | 15 | 0.1% |
| ⅔ | 6 | < 0.1% |
| ³ | 6 | < 0.1% |
| ⅜ | 4 | < 0.1% |
| ¹ | 2 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 7069 | |
| ± | 5350 | |
| + | 3005 | |
| > | 1166 | 6.2% |
| < | 1123 | 6.0% |
| ~ | 1026 | 5.5% |
| | | 39 | 0.2% |
| → | 12 | 0.1% |
| ∆ | 8 | < 0.1% |
| ↔ | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 471051 | |
| 2 | 325524 | |
| 0 | 281269 | |
| 5 | 270344 | |
| 3 | 241514 | |
| 4 | 188715 | |
| 6 | 149071 | 6.5% |
| 8 | 126175 | 5.5% |
| 7 | 123405 | 5.4% |
| 9 | 111934 | 4.9% |
Modifier Letter
| Value | Count | Frequency (%) |
| ʻ | 442 | |
| ᵉ | 7 | 1.5% |
| ˮ | 2 | 0.4% |
| ᵍ | 1 | 0.2% |
| ᶵ | 1 | 0.2% |
| ᵈ | 1 | 0.2% |
| ˍ | 1 | 0.2% |
| ʼ | 1 | 0.2% |
| ᴸ | 1 | 0.2% |
| ᴱ | 1 | 0.2% |
Other Letter
| Value | Count | Frequency (%) |
| º | 3444 | |
| ª | 95 | 2.7% |
| 林 | 1 | < 0.1% |
| 大 | 1 | < 0.1% |
| 道 | 1 | < 0.1% |
| 郡 | 1 | < 0.1% |
| 角 | 1 | < 0.1% |
| 平 | 1 | < 0.1% |
| 太 | 1 | < 0.1% |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ͤ | 9 | |
| ̄ | 6 | |
| ̈ | 4 | |
| ̋ | 3 | 10.7% |
| ̩ | 2 | 7.1% |
| ́ | 1 | 3.6% |
| ̌ | 1 | 3.6% |
| ᷉ | 1 | 3.6% |
| ̊ | 1 | 3.6% |
Format
| Value | Count | Frequency (%) |
| | 7 | |
| | 3 | |
| | 3 | |
| | 2 | 8.7% |
| | 2 | 8.7% |
| | 2 | 8.7% |
| | 2 | 8.7% |
| | 2 | 8.7% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 491 | |
| ¨ | 32 | 5.9% |
| ^ | 11 | 2.0% |
| ˶ | 5 | 0.9% |
| ˚ | 3 | 0.6% |
| ˜ | 2 | 0.4% |
| ¯ | 1 | 0.2% |
Currency Symbol
| Value | Count | Frequency (%) |
| ¢ | 114 | |
| ¤ | 96 | |
| $ | 19 | 7.8% |
| £ | 14 | 5.7% |
| ¥ | 1 | 0.4% |
| € | 1 | 0.4% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 269825 | |
| [ | 94852 | 26.0% |
| „ | 402 | 0.1% |
| ‚ | 198 | 0.1% |
| { | 94 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 522467 | |
| – | 36 | < 0.1% |
| — | 18 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 267308 | |
| ] | 95098 | 26.2% |
| } | 100 | < 0.1% |
Final Punctuation
| Value | Count | Frequency (%) |
| » | 802 | |
| ” | 46 | 5.3% |
| › | 14 | 1.6% |
Initial Punctuation
| Value | Count | Frequency (%) |
| « | 792 | |
| “ | 159 | 16.7% |
| ‛ | 1 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 27250345 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 343 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 157662328 | |
| Common | 41048170 | 20.7% |
| Greek | 137 | < 0.1% |
| Inherited | 30 | < 0.1% |
| Han | 7 | < 0.1% |
| Cyrillic | 3 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 18162578 | 11.5% |
| e | 14022740 | 8.9% |
| o | 13213905 | 8.4% |
| n | 11023738 | 7.0% |
| i | 10220547 | 6.5% |
| r | 10113701 | 6.4% |
| t | 8788622 | 5.6% |
| l | 7107480 | 4.5% |
| s | 6902603 | 4.4% |
| u | 5110133 | 3.2% |
| Other values (250) | 52996281 |
Common
| Value | Count | Frequency (%) |
| 27250345 | ||
| . | 5595503 | 13.6% |
| , | 3400184 | 8.3% |
| - | 522467 | 1.3% |
| 1 | 471051 | 1.1% |
| : | 451617 | 1.1% |
| 2 | 325524 | 0.8% |
| 0 | 281269 | 0.7% |
| 5 | 270344 | 0.7% |
| ( | 269825 | 0.7% |
| Other values (116) | 2210041 | 5.4% |
Greek
| Value | Count | Frequency (%) |
| λ | 26 | |
| ν | 21 | |
| η | 15 | |
| ή | 13 | |
| υ | 13 | |
| Κ | 13 | |
| ρ | 7 | 5.1% |
| α | 4 | 2.9% |
| ο | 4 | 2.9% |
| ά | 3 | 2.2% |
| Other values (8) | 18 |
Inherited
| Value | Count | Frequency (%) |
| ͤ | 9 | |
| ̄ | 6 | |
| ̈ | 4 | |
| ̋ | 3 | 10.0% |
| | 2 | 6.7% |
| ̩ | 2 | 6.7% |
| ́ | 1 | 3.3% |
| ̌ | 1 | 3.3% |
| ᷉ | 1 | 3.3% |
| ̊ | 1 | 3.3% |
Han
| Value | Count | Frequency (%) |
| 林 | 1 | |
| 大 | 1 | |
| 道 | 1 | |
| 郡 | 1 | |
| 角 | 1 | |
| 平 | 1 | |
| 太 | 1 |
Cyrillic
| Value | Count | Frequency (%) |
| ҫ | 1 | |
| ӧ | 1 | |
| ӗ | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 197800565 | |
| None | 908364 | 0.5% |
| Punctuation | 1039 | < 0.1% |
| Modifier Letters | 456 | < 0.1% |
| Number Forms | 79 | < 0.1% |
| Latin Ext Additional | 46 | < 0.1% |
| Box Drawing | 33 | < 0.1% |
| Diacriticals | 27 | < 0.1% |
| Arrows | 13 | < 0.1% |
| Phonetic Ext | 11 | < 0.1% |
| Other values (12) | 42 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 27250345 | 13.8% | |
| a | 18162578 | 9.2% |
| e | 14022740 | 7.1% |
| o | 13213905 | 6.7% |
| n | 11023738 | 5.6% |
| i | 10220547 | 5.2% |
| r | 10113701 | 5.1% |
| t | 8788622 | 4.4% |
| l | 7107480 | 3.6% |
| s | 6902603 | 3.5% |
| Other values (90) | 70994306 |
None
| Value | Count | Frequency (%) |
| í | 211195 | |
| á | 166897 | |
| é | 110126 | |
| ó | 93913 | |
| ñ | 46099 | 5.1% |
| ã | 36696 | 4.0% |
| ú | 26716 | 2.9% |
| ç | 22907 | 2.5% |
| ü | 18482 | 2.0% |
| ä | 16954 | 1.9% |
| Other values (221) | 158379 |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 442 | |
| ˶ | 5 | 1.1% |
| ˚ | 3 | 0.7% |
| ˮ | 2 | 0.4% |
| ˜ | 2 | 0.4% |
| ˍ | 1 | 0.2% |
| ʼ | 1 | 0.2% |
Punctuation
| Value | Count | Frequency (%) |
| „ | 402 | |
| ‚ | 198 | |
| “ | 159 | 15.3% |
| … | 149 | 14.3% |
| ” | 46 | 4.4% |
| – | 36 | 3.5% |
| — | 18 | 1.7% |
| › | 14 | 1.3% |
| | 3 | 0.3% |
| | 3 | 0.3% |
| Other values (7) | 11 | 1.1% |
Number Forms
| Value | Count | Frequency (%) |
| ⅓ | 52 | |
| ⅛ | 15 | 19.0% |
| ⅔ | 6 | 7.6% |
| ⅜ | 4 | 5.1% |
| ⅕ | 1 | 1.3% |
| ⅝ | 1 | 1.3% |
Box Drawing
| Value | Count | Frequency (%) |
| ├ | 18 | |
| ┬ | 6 | 18.2% |
| │ | 3 | 9.1% |
| ─ | 3 | 9.1% |
| ┼ | 1 | 3.0% |
| ║ | 1 | 3.0% |
| ╢ | 1 | 3.0% |
Arrows
| Value | Count | Frequency (%) |
| → | 12 | |
| ↔ | 1 | 7.7% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ṅ | 9 | |
| ắ | 7 | |
| ị | 4 | |
| ḗ | 4 | |
| ẽ | 3 | 6.5% |
| ḿ | 3 | 6.5% |
| ṁ | 2 | 4.3% |
| ạ | 2 | 4.3% |
| ẑ | 2 | 4.3% |
| ộ | 2 | 4.3% |
| Other values (7) | 8 |
Diacriticals
| Value | Count | Frequency (%) |
| ͤ | 9 | |
| ̄ | 6 | |
| ̈ | 4 | |
| ̋ | 3 | 11.1% |
| ̩ | 2 | 7.4% |
| ́ | 1 | 3.7% |
| ̌ | 1 | 3.7% |
| ̊ | 1 | 3.7% |
Math Operators
| Value | Count | Frequency (%) |
| ∆ | 8 |
Phonetic Ext
| Value | Count | Frequency (%) |
| ᵉ | 7 | |
| ᵍ | 1 | 9.1% |
| ᵈ | 1 | 9.1% |
| ᴸ | 1 | 9.1% |
| ᴱ | 1 | 9.1% |
Block Elements
| Value | Count | Frequency (%) |
| ░ | 6 | |
| ▒ | 2 | 25.0% |
IPA Ext
| Value | Count | Frequency (%) |
| ɶ | 3 | |
| ɐ | 1 | 20.0% |
| ʈ | 1 | 20.0% |
Greek Ext
| Value | Count | Frequency (%) |
| ῡ | 2 | |
| ᾰ | 1 |
Misc Symbols
| Value | Count | Frequency (%) |
| ☼ | 2 |
Letterlike Symbols
| Value | Count | Frequency (%) |
| ™ | 2 |
Cyrillic
| Value | Count | Frequency (%) |
| ҫ | 1 | |
| ӧ | 1 | |
| ӗ | 1 |
CJK
| Value | Count | Frequency (%) |
| 林 | 1 | |
| 大 | 1 | |
| 道 | 1 | |
| 郡 | 1 | |
| 角 | 1 | |
| 平 | 1 | |
| 太 | 1 |
Diacriticals Sup
| Value | Count | Frequency (%) |
| ᷉ | 1 |
Phonetic Ext Sup
| Value | Count | Frequency (%) |
| ᶵ | 1 |
Currency Symbols
| Value | Count | Frequency (%) |
| € | 1 |
Misc Technical
| Value | Count | Frequency (%) |
| ⌐ | 1 |
verbatimLocality
Text
Missing 
| Distinct | 5 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 4515656 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 10 |
| Mean length | 10.4 |
| Min length | 10 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 83 41'11"W |
|---|---|
| 2nd row | 78 50' 50" W |
| 3rd row | 82 44'01"W |
| 4th row | 82 43'07"W |
| 5th row | 83 37'47"W |
| Value | Count | Frequency (%) |
| 83 | 2 | |
| 50 | 2 | |
| 82 | 2 | |
| 41'11"w | 1 | |
| 78 | 1 | |
| w | 1 | |
| 44'01"w | 1 | |
| 43'07"w | 1 | |
| 37'47"w | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 7 | ||
| 8 | 5 | |
| 4 | 5 | |
| ' | 5 | |
| " | 5 | |
| W | 5 | |
| 3 | 4 | |
| 1 | 4 | |
| 7 | 4 | |
| 0 | 4 | |
| Other values (2) | 4 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 30 | |
| Other Punctuation | 10 | 19.2% |
| Space Separator | 7 | 13.5% |
| Uppercase Letter | 5 | 9.6% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 5 | |
| 4 | 5 | |
| 3 | 4 | |
| 1 | 4 | |
| 7 | 4 | |
| 0 | 4 | |
| 5 | 2 | 6.7% |
| 2 | 2 | 6.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 5 | |
| " | 5 |
Space Separator
| Value | Count | Frequency (%) |
| 7 |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 47 | |
| Latin | 5 | 9.6% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 7 | ||
| 8 | 5 | |
| 4 | 5 | |
| ' | 5 | |
| " | 5 | |
| 3 | 4 | |
| 1 | 4 | |
| 7 | 4 | |
| 0 | 4 | |
| 5 | 2 | 4.3% |
Latin
| Value | Count | Frequency (%) |
| W | 5 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 52 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 7 | ||
| 8 | 5 | |
| 4 | 5 | |
| ' | 5 | |
| " | 5 | |
| W | 5 | |
| 3 | 4 | |
| 1 | 4 | |
| 7 | 4 | |
| 0 | 4 | |
| Other values (2) | 4 |
Missing 
| Distinct | 4970 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 2860984 |
| Missing (%) | 63.4% |
| Memory size | 34.5 MiB |
Length
| Max length | 23 |
|---|---|
| Median length | 9 |
| Mean length | 5.339273465 |
| Min length | 3 |
Unique
| Unique | 773 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 2742.0 |
|---|---|
| 2nd row | 750.0 |
| 3rd row | 50.0 |
| 4th row | 0.0 |
| 5th row | 17.0 |
| Value | Count | Frequency (%) |
| 100.0 | 34114 | 2.1% |
| 1000.0 | 33132 | 2.0% |
| 200.0 | 29727 | 1.8% |
| 300.0 | 26699 | 1.6% |
| 500.0 | 26609 | 1.6% |
| 1500.0 | 25888 | 1.6% |
| 800.0 | 25536 | 1.5% |
| 400.0 | 23967 | 1.4% |
| 900.0 | 23544 | 1.4% |
| 1200.0 | 23149 | 1.4% |
| Other values (4932) | 1382314 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3504011 | |
| . | 1654676 | |
| 1 | 806932 | 9.1% |
| 2 | 627849 | 7.1% |
| 5 | 506466 | 5.7% |
| 3 | 406488 | 4.6% |
| 4 | 318940 | 3.6% |
| 6 | 271625 | 3.1% |
| 8 | 261398 | 3.0% |
| 7 | 254146 | 2.9% |
| Other values (17) | 222242 | 2.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7179762 | |
| Other Punctuation | 1654676 | 18.7% |
| Dash Punctuation | 312 | < 0.1% |
| Lowercase Letter | 18 | < 0.1% |
| Uppercase Letter | 3 | < 0.1% |
| Space Separator | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 5 | |
| s | 3 | |
| n | 2 | 11.1% |
| g | 1 | 5.6% |
| r | 1 | 5.6% |
| i | 1 | 5.6% |
| u | 1 | 5.6% |
| t | 1 | 5.6% |
| c | 1 | 5.6% |
| o | 1 | 5.6% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3504011 | |
| 1 | 806932 | 11.2% |
| 2 | 627849 | 8.7% |
| 5 | 506466 | 7.1% |
| 3 | 406488 | 5.7% |
| 4 | 318940 | 4.4% |
| 6 | 271625 | 3.8% |
| 8 | 261398 | 3.6% |
| 7 | 254146 | 3.5% |
| 9 | 221907 | 3.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 1 | |
| M | 1 | |
| S | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1654676 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 312 |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8834752 | |
| Latin | 21 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 5 | |
| s | 3 | |
| n | 2 | 9.5% |
| D | 1 | 4.8% |
| g | 1 | 4.8% |
| r | 1 | 4.8% |
| M | 1 | 4.8% |
| i | 1 | 4.8% |
| u | 1 | 4.8% |
| t | 1 | 4.8% |
| Other values (4) | 4 |
Common
| Value | Count | Frequency (%) |
| 0 | 3504011 | |
| . | 1654676 | |
| 1 | 806932 | 9.1% |
| 2 | 627849 | 7.1% |
| 5 | 506466 | 5.7% |
| 3 | 406488 | 4.6% |
| 4 | 318940 | 3.6% |
| 6 | 271625 | 3.1% |
| 8 | 261398 | 3.0% |
| 7 | 254146 | 2.9% |
| Other values (3) | 222221 | 2.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8834773 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3504011 | |
| . | 1654676 | |
| 1 | 806932 | 9.1% |
| 2 | 627849 | 7.1% |
| 5 | 506466 | 5.7% |
| 3 | 406488 | 4.6% |
| 4 | 318940 | 3.6% |
| 6 | 271625 | 3.1% |
| 8 | 261398 | 3.0% |
| 7 | 254146 | 2.9% |
| Other values (17) | 222242 | 2.5% |
Missing 
| Distinct | 2856 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 4017410 |
| Missing (%) | 89.0% |
| Memory size | 34.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 6 |
| Mean length | 5.395878784 |
| Min length | 3 |
Unique
| Unique | 706 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 450.0 |
|---|---|
| 2nd row | 850.0 |
| 3rd row | 792.0 |
| 4th row | 1680.0 |
| 5th row | 1981.0 |
| Value | Count | Frequency (%) |
| 1000.0 | 11909 | 2.4% |
| 600.0 | 10801 | 2.2% |
| 500.0 | 10617 | 2.1% |
| 1500.0 | 10106 | 2.0% |
| 900.0 | 9847 | 2.0% |
| 1200.0 | 9380 | 1.9% |
| 100.0 | 9062 | 1.8% |
| 400.0 | 8956 | 1.8% |
| 300.0 | 8472 | 1.7% |
| 2000.0 | 8255 | 1.7% |
| Other values (2842) | 400846 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1146584 | |
| . | 498251 | |
| 1 | 224425 | 8.3% |
| 2 | 185816 | 6.9% |
| 5 | 156368 | 5.8% |
| 3 | 121709 | 4.5% |
| 4 | 85324 | 3.2% |
| 6 | 75371 | 2.8% |
| 8 | 70381 | 2.6% |
| 7 | 65922 | 2.5% |
| Other values (2) | 58351 | 2.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2190247 | |
| Other Punctuation | 498251 | 18.5% |
| Dash Punctuation | 4 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1146584 | |
| 1 | 224425 | 10.2% |
| 2 | 185816 | 8.5% |
| 5 | 156368 | 7.1% |
| 3 | 121709 | 5.6% |
| 4 | 85324 | 3.9% |
| 6 | 75371 | 3.4% |
| 8 | 70381 | 3.2% |
| 7 | 65922 | 3.0% |
| 9 | 58347 | 2.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 498251 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2688502 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1146584 | |
| . | 498251 | |
| 1 | 224425 | 8.3% |
| 2 | 185816 | 6.9% |
| 5 | 156368 | 5.8% |
| 3 | 121709 | 4.5% |
| 4 | 85324 | 3.2% |
| 6 | 75371 | 2.8% |
| 8 | 70381 | 2.6% |
| 7 | 65922 | 2.5% |
| Other values (2) | 58351 | 2.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2688502 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1146584 | |
| . | 498251 | |
| 1 | 224425 | 8.3% |
| 2 | 185816 | 6.9% |
| 5 | 156368 | 5.8% |
| 3 | 121709 | 4.5% |
| 4 | 85324 | 3.2% |
| 6 | 75371 | 2.8% |
| 8 | 70381 | 2.6% |
| 7 | 65922 | 2.5% |
| Other values (2) | 58351 | 2.2% |
Missing 
| Distinct | 172 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 4475632 |
| Missing (%) | 99.1% |
| Memory size | 34.5 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 3 |
| Mean length | 3.485947688 |
| Min length | 3 |
Unique
| Unique | 61 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | 3.0 |
|---|---|
| 2nd row | 21.0 |
| 3rd row | 3.0 |
| 4th row | 9.0 |
| 5th row | 3.0 |
| Value | Count | Frequency (%) |
| 3.0 | 7621 | |
| 9.0 | 6771 | |
| 15.0 | 6023 | |
| 21.0 | 4195 | |
| 0.0 | 2654 | 6.6% |
| 37.0 | 2182 | 5.5% |
| 27.0 | 1874 | 4.7% |
| 2.0 | 1319 | 3.3% |
| 12.0 | 1049 | 2.6% |
| 1.0 | 878 | 2.2% |
| Other values (161) | 5463 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 43497 | |
| . | 40029 | |
| 1 | 14301 | 10.2% |
| 3 | 10509 | 7.5% |
| 2 | 9006 | 6.5% |
| 5 | 7321 | 5.2% |
| 9 | 7125 | 5.1% |
| 7 | 4678 | 3.4% |
| 6 | 1307 | 0.9% |
| 4 | 929 | 0.7% |
| Other values (2) | 837 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 99509 | |
| Other Punctuation | 40029 | |
| Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 43497 | |
| 1 | 14301 | 14.4% |
| 3 | 10509 | 10.6% |
| 2 | 9006 | 9.1% |
| 5 | 7321 | 7.4% |
| 9 | 7125 | 7.2% |
| 7 | 4678 | 4.7% |
| 6 | 1307 | 1.3% |
| 4 | 929 | 0.9% |
| 8 | 836 | 0.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 40029 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 139539 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 43497 | |
| . | 40029 | |
| 1 | 14301 | 10.2% |
| 3 | 10509 | 7.5% |
| 2 | 9006 | 6.5% |
| 5 | 7321 | 5.2% |
| 9 | 7125 | 5.1% |
| 7 | 4678 | 3.4% |
| 6 | 1307 | 0.9% |
| 4 | 929 | 0.7% |
| Other values (2) | 837 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 139539 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 43497 | |
| . | 40029 | |
| 1 | 14301 | 10.2% |
| 3 | 10509 | 7.5% |
| 2 | 9006 | 6.5% |
| 5 | 7321 | 5.2% |
| 9 | 7125 | 5.1% |
| 7 | 4678 | 3.4% |
| 6 | 1307 | 0.9% |
| 4 | 929 | 0.7% |
| Other values (2) | 837 | 0.6% |
Missing 
| Distinct | 80 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 4478965 |
| Missing (%) | 99.2% |
| Memory size | 34.5 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 4 |
| Mean length | 3.677785045 |
| Min length | 3 |
Unique
| Unique | 19 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 3.0 |
|---|---|
| 2nd row | 27.0 |
| 3rd row | 3.0 |
| 4th row | 15.0 |
| 5th row | 9.0 |
| Value | Count | Frequency (%) |
| 9.0 | 5795 | |
| 15.0 | 5742 | |
| 21.0 | 5321 | |
| 27.0 | 4190 | |
| 3.0 | 3061 | |
| 49.0 | 1873 | 5.1% |
| 37.0 | 1753 | 4.8% |
| 14.0 | 1250 | 3.4% |
| 11.0 | 1072 | 2.9% |
| 5.0 | 1049 | 2.9% |
| Other values (70) | 5590 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 37406 | |
| . | 36696 | |
| 1 | 16747 | |
| 2 | 10958 | 8.1% |
| 9 | 7822 | 5.8% |
| 5 | 7226 | 5.4% |
| 7 | 6969 | 5.2% |
| 3 | 5009 | 3.7% |
| 4 | 3586 | 2.7% |
| 6 | 1850 | 1.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 98264 | |
| Other Punctuation | 36696 | 27.2% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 37406 | |
| 1 | 16747 | |
| 2 | 10958 | 11.2% |
| 9 | 7822 | 8.0% |
| 5 | 7226 | 7.4% |
| 7 | 6969 | 7.1% |
| 3 | 5009 | 5.1% |
| 4 | 3586 | 3.6% |
| 6 | 1850 | 1.9% |
| 8 | 691 | 0.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 36696 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 134960 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 37406 | |
| . | 36696 | |
| 1 | 16747 | |
| 2 | 10958 | 8.1% |
| 9 | 7822 | 5.8% |
| 5 | 7226 | 5.4% |
| 7 | 6969 | 5.2% |
| 3 | 5009 | 3.7% |
| 4 | 3586 | 2.7% |
| 6 | 1850 | 1.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 134960 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 37406 | |
| . | 36696 | |
| 1 | 16747 | |
| 2 | 10958 | 8.1% |
| 9 | 7822 | 5.8% |
| 5 | 7226 | 5.4% |
| 7 | 6969 | 5.2% |
| 3 | 5009 | 3.7% |
| 4 | 3586 | 2.7% |
| 6 | 1850 | 1.4% |
verbatimDepth
Text
Missing 
| Distinct | 18 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 4494022 |
| Missing (%) | 99.5% |
| Memory size | 34.5 MiB |
Length
| Max length | 37 |
|---|---|
| Median length | 3 |
| Mean length | 3.033874024 |
| Min length | 2 |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | ca. |
|---|---|
| 2nd row | ca. |
| 3rd row | ca. |
| 4th row | ca. |
| 5th row | ca. |
| Value | Count | Frequency (%) |
| ca | 21589 | |
| intertidal | 57 | 0.3% |
| mlw | 15 | 0.1% |
| infralittoral | 12 | 0.1% |
| below | 6 | < 0.1% |
| above | 5 | < 0.1% |
| low | 5 | < 0.1% |
| tide | 5 | < 0.1% |
| feet | 2 | < 0.1% |
| cay | 2 | < 0.1% |
| Other values (20) | 22 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 21682 | |
| c | 21591 | |
| . | 21426 | |
| t | 151 | 0.2% |
| l | 115 | 0.2% |
| i | 97 | 0.1% |
| e | 90 | 0.1% |
| r | 85 | 0.1% |
| 81 | 0.1% | |
| n | 70 | 0.1% |
| Other values (24) | 262 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 44081 | |
| Other Punctuation | 21426 | |
| Space Separator | 81 | 0.1% |
| Uppercase Letter | 49 | 0.1% |
| Decimal Number | 6 | < 0.1% |
| Open Punctuation | 2 | < 0.1% |
| Close Punctuation | 2 | < 0.1% |
| Dash Punctuation | 2 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 21682 | |
| c | 21591 | |
| t | 151 | 0.3% |
| l | 115 | 0.3% |
| i | 97 | 0.2% |
| e | 90 | 0.2% |
| r | 85 | 0.2% |
| n | 70 | 0.2% |
| d | 65 | 0.1% |
| o | 37 | 0.1% |
| Other values (11) | 98 | 0.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 0 | 2 | |
| 4 | 1 | |
| 8 | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 47 | |
| A | 1 | 2.0% |
| D | 1 | 2.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 21426 |
Space Separator
| Value | Count | Frequency (%) |
| 81 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Math Symbol
| Value | Count | Frequency (%) |
| < | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 44130 | |
| Common | 21520 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 21682 | |
| c | 21591 | |
| t | 151 | 0.3% |
| l | 115 | 0.3% |
| i | 97 | 0.2% |
| e | 90 | 0.2% |
| r | 85 | 0.2% |
| n | 70 | 0.2% |
| d | 65 | 0.1% |
| I | 47 | 0.1% |
| Other values (14) | 137 | 0.3% |
Common
| Value | Count | Frequency (%) |
| . | 21426 | |
| 81 | 0.4% | |
| ( | 2 | < 0.1% |
| ) | 2 | < 0.1% |
| 1 | 2 | < 0.1% |
| 0 | 2 | < 0.1% |
| - | 2 | < 0.1% |
| < | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 65650 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 21682 | |
| c | 21591 | |
| . | 21426 | |
| t | 151 | 0.2% |
| l | 115 | 0.2% |
| i | 97 | 0.1% |
| e | 90 | 0.1% |
| r | 85 | 0.1% |
| 81 | 0.1% | |
| n | 70 | 0.1% |
| Other values (24) | 262 | 0.4% |
decimalLatitude
Text
Missing 
| Distinct | 65348 |
|---|---|
| Distinct (%) | 9.8% |
| Missing | 3845453 |
| Missing (%) | 85.2% |
| Memory size | 34.5 MiB |
Length
| Max length | 55 |
|---|---|
| Median length | 30 |
| Mean length | 5.797518084 |
| Min length | 3 |
Unique
| Unique | 29342 ? |
|---|---|
| Unique (%) | 4.4% |
Sample
| 1st row | 26.2786 |
|---|---|
| 2nd row | -35.57 |
| 3rd row | 18.6519 |
| 4th row | -36.68 |
| 5th row | 5.86667 |
| Value | Count | Frequency (%) |
| 38.895 | 3805 | 0.6% |
| 38.9694 | 3780 | 0.6% |
| 3.61 | 1737 | 0.3% |
| 0.83 | 1704 | 0.3% |
| 9.405 | 1696 | 0.3% |
| 5.16667 | 1640 | 0.2% |
| 0.35 | 1588 | 0.2% |
| 38.8664 | 1571 | 0.2% |
| 5.2 | 1487 | 0.2% |
| 12.83 | 1407 | 0.2% |
| Other values (59827) | 649801 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 670206 | |
| 3 | 456711 | |
| 1 | 360359 | |
| 8 | 332357 | |
| 2 | 329986 | |
| 5 | 313169 | |
| 6 | 300506 | |
| 7 | 272005 | |
| 4 | 239301 | 6.2% |
| 9 | 228861 | 5.9% |
| Other values (29) | 382082 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3026714 | |
| Other Punctuation | 670211 | 17.2% |
| Dash Punctuation | 188539 | 4.9% |
| Lowercase Letter | 60 | < 0.1% |
| Uppercase Letter | 11 | < 0.1% |
| Space Separator | 8 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 11 | |
| i | 8 | |
| n | 6 | |
| r | 5 | |
| o | 5 | |
| e | 4 | 6.7% |
| t | 4 | 6.7% |
| c | 4 | 6.7% |
| s | 4 | 6.7% |
| l | 2 | 3.3% |
| Other values (6) | 7 |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 456711 | |
| 1 | 360359 | |
| 8 | 332357 | |
| 2 | 329986 | |
| 5 | 313169 | |
| 6 | 300506 | |
| 7 | 272005 | |
| 4 | 239301 | |
| 9 | 228861 | |
| 0 | 193459 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 2 | |
| S | 2 | |
| C | 1 | |
| U | 1 | |
| N | 1 | |
| J | 1 | |
| I | 1 | |
| T | 1 | |
| F | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 670206 | |
| , | 5 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 188539 |
Space Separator
| Value | Count | Frequency (%) |
| 8 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3885472 | |
| Latin | 71 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 11 | |
| i | 8 | |
| n | 6 | 8.5% |
| r | 5 | 7.0% |
| o | 5 | 7.0% |
| e | 4 | 5.6% |
| t | 4 | 5.6% |
| c | 4 | 5.6% |
| s | 4 | 5.6% |
| l | 2 | 2.8% |
| Other values (15) | 18 |
Common
| Value | Count | Frequency (%) |
| . | 670206 | |
| 3 | 456711 | |
| 1 | 360359 | |
| 8 | 332357 | |
| 2 | 329986 | |
| 5 | 313169 | |
| 6 | 300506 | |
| 7 | 272005 | |
| 4 | 239301 | 6.2% |
| 9 | 228861 | 5.9% |
| Other values (4) | 382011 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3885543 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 670206 | |
| 3 | 456711 | |
| 1 | 360359 | |
| 8 | 332357 | |
| 2 | 329986 | |
| 5 | 313169 | |
| 6 | 300506 | |
| 7 | 272005 | |
| 4 | 239301 | 6.2% |
| 9 | 228861 | 5.9% |
| Other values (29) | 382082 |
decimalLongitude
Text
Missing 
| Distinct | 67040 |
|---|---|
| Distinct (%) | 10.0% |
| Missing | 3845454 |
| Missing (%) | 85.2% |
| Memory size | 34.5 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 10 |
| Mean length | 6.786640545 |
| Min length | 3 |
Unique
| Unique | 27805 ? |
|---|---|
| Unique (%) | 4.1% |
Sample
| 1st row | -83.7803 |
|---|---|
| 2nd row | 137.32 |
| 3rd row | -71.5572 |
| 4th row | -72.97 |
| 5th row | -60.5667 |
| Value | Count | Frequency (%) |
| 77.0367 | 3741 | 0.6% |
| 77.1767 | 3712 | 0.6% |
| 59.4833 | 2407 | 0.4% |
| 53.2 | 1746 | 0.3% |
| 79.8635 | 1622 | 0.2% |
| 52.33 | 1591 | 0.2% |
| 77.7064 | 1461 | 0.2% |
| 59.48 | 1420 | 0.2% |
| 70.95 | 1409 | 0.2% |
| 88.08 | 1385 | 0.2% |
| Other values (62110) | 649714 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 670205 | |
| - | 569111 | |
| 7 | 494921 | |
| 1 | 399449 | |
| 6 | 383274 | |
| 5 | 381184 | |
| 8 | 322689 | |
| 3 | 322209 | |
| 9 | 271258 | |
| 2 | 262807 | 5.8% |
| Other values (18) | 471347 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3309113 | |
| Other Punctuation | 670205 | 14.7% |
| Dash Punctuation | 569111 | 12.5% |
| Lowercase Letter | 20 | < 0.1% |
| Uppercase Letter | 4 | < 0.1% |
| Space Separator | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 3 | |
| a | 3 | |
| i | 3 | |
| o | 2 | |
| c | 2 | |
| s | 1 | 5.0% |
| p | 1 | 5.0% |
| l | 1 | 5.0% |
| t | 1 | 5.0% |
| h | 1 | 5.0% |
| Other values (2) | 2 |
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 494921 | |
| 1 | 399449 | |
| 6 | 383274 | |
| 5 | 381184 | |
| 8 | 322689 | |
| 3 | 322209 | |
| 9 | 271258 | |
| 2 | 262807 | |
| 0 | 236187 | |
| 4 | 235135 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 2 | |
| T | 1 | |
| N | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 670205 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 569111 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4548430 | |
| Latin | 24 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 3 | |
| a | 3 | |
| i | 3 | |
| A | 2 | 8.3% |
| o | 2 | 8.3% |
| c | 2 | 8.3% |
| s | 1 | 4.2% |
| T | 1 | 4.2% |
| p | 1 | 4.2% |
| l | 1 | 4.2% |
| Other values (5) | 5 |
Common
| Value | Count | Frequency (%) |
| . | 670205 | |
| - | 569111 | |
| 7 | 494921 | |
| 1 | 399449 | |
| 6 | 383274 | |
| 5 | 381184 | |
| 8 | 322689 | |
| 3 | 322209 | |
| 9 | 271258 | |
| 2 | 262807 | 5.8% |
| Other values (3) | 471323 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4548454 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 670205 | |
| - | 569111 | |
| 7 | 494921 | |
| 1 | 399449 | |
| 6 | 383274 | |
| 5 | 381184 | |
| 8 | 322689 | |
| 3 | 322209 | |
| 9 | 271258 | |
| 2 | 262807 | 5.8% |
| Other values (18) | 471347 |
geodeticDatum
Text
Missing 
| Distinct | 13 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4485859 |
| Missing (%) | 99.3% |
| Memory size | 34.5 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 18 |
| Mean length | 14.73068922 |
| Min length | 4 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | WGS84 |
|---|---|
| 2nd row | WGS 84 (EPSG:4326) |
| 3rd row | WGS84 |
| 4th row | WGS 84 (EPSG:4326) |
| 5th row | WGS 84 (EPSG:4326) |
| Value | Count | Frequency (%) |
| wgs | 20853 | |
| 84 | 20853 | |
| epsg:4326 | 20842 | |
| wgs84 | 6659 | 9.0% |
| not | 1679 | 2.3% |
| recorded | 1679 | 2.3% |
| nad83 | 385 | 0.5% |
| epsg:4269 | 385 | 0.5% |
| epsg:4267 | 212 | 0.3% |
| nad27 | 212 | 0.3% |
| Other values (8) | 24 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 48970 | |
| G | 48969 | |
| S | 48960 | |
| 43981 | ||
| 8 | 27912 | 6.4% |
| W | 27512 | 6.3% |
| 2 | 21661 | 4.9% |
| ( | 21448 | 4.9% |
| E | 21448 | 4.9% |
| P | 21448 | 4.9% |
| Other values (21) | 106695 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 173319 | |
| Decimal Number | 142056 | |
| Space Separator | 43981 | 10.0% |
| Open Punctuation | 21448 | 4.9% |
| Other Punctuation | 21448 | 4.9% |
| Close Punctuation | 21448 | 4.9% |
| Lowercase Letter | 15299 | 3.5% |
| Dash Punctuation | 5 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 48970 | |
| 8 | 27912 | |
| 2 | 21661 | |
| 6 | 21439 | |
| 3 | 21239 | |
| 7 | 425 | 0.3% |
| 9 | 397 | 0.3% |
| 1 | 9 | < 0.1% |
| 0 | 3 | < 0.1% |
| 5 | 1 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 48969 | |
| S | 48960 | |
| W | 27512 | |
| E | 21448 | |
| P | 21448 | |
| N | 2183 | 1.3% |
| R | 1585 | 0.9% |
| A | 607 | 0.4% |
| D | 607 | 0.4% |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 3358 | |
| d | 3358 | |
| o | 3358 | |
| r | 1773 | |
| t | 1679 | |
| c | 1679 | |
| n | 94 | 0.6% |
Space Separator
| Value | Count | Frequency (%) |
| 43981 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 21448 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 21448 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 21448 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 250386 | |
| Latin | 188618 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| G | 48969 | |
| S | 48960 | |
| W | 27512 | |
| E | 21448 | |
| P | 21448 | |
| e | 3358 | 1.8% |
| d | 3358 | 1.8% |
| o | 3358 | 1.8% |
| N | 2183 | 1.2% |
| r | 1773 | 0.9% |
| Other values (6) | 6251 | 3.3% |
Common
| Value | Count | Frequency (%) |
| 4 | 48970 | |
| 43981 | ||
| 8 | 27912 | |
| 2 | 21661 | |
| ( | 21448 | |
| : | 21448 | |
| ) | 21448 | |
| 6 | 21439 | |
| 3 | 21239 | |
| 7 | 425 | 0.2% |
| Other values (5) | 415 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 439004 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 48970 | |
| G | 48969 | |
| S | 48960 | |
| 43981 | ||
| 8 | 27912 | 6.4% |
| W | 27512 | 6.3% |
| 2 | 21661 | 4.9% |
| ( | 21448 | 4.9% |
| E | 21448 | 4.9% |
| P | 21448 | 4.9% |
| Other values (21) | 106695 |
coordinateUncertaintyInMeters
Text
Missing 
| Distinct | 21 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 4509192 |
| Missing (%) | 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 4 |
| Mean length | 3.866439944 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 16000 |
|---|---|
| 2nd row | 1500 |
| 3rd row | 250 |
| 4th row | 500 |
| 5th row | 1500 |
| Value | Count | Frequency (%) |
| 16000 | 1334 | |
| 1000 | 1304 | |
| 500 | 1065 | |
| 3000 | 624 | |
| 250 | 622 | |
| 750 | 305 | 4.7% |
| 5000 | 282 | 4.4% |
| 1500 | 267 | 4.1% |
| 2000 | 202 | 3.1% |
| 3500 | 148 | 2.3% |
| Other values (11) | 316 | 4.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 15790 | |
| 1 | 3052 | 12.2% |
| 5 | 2771 | 11.1% |
| 6 | 1370 | 5.5% |
| 2 | 879 | 3.5% |
| 3 | 785 | 3.1% |
| 7 | 305 | 1.2% |
| 8 | 50 | 0.2% |
| 4 | 10 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 25012 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 15790 | |
| 1 | 3052 | 12.2% |
| 5 | 2771 | 11.1% |
| 6 | 1370 | 5.5% |
| 2 | 879 | 3.5% |
| 3 | 785 | 3.1% |
| 7 | 305 | 1.2% |
| 8 | 50 | 0.2% |
| 4 | 10 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 25012 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 15790 | |
| 1 | 3052 | 12.2% |
| 5 | 2771 | 11.1% |
| 6 | 1370 | 5.5% |
| 2 | 879 | 3.5% |
| 3 | 785 | 3.1% |
| 7 | 305 | 1.2% |
| 8 | 50 | 0.2% |
| 4 | 10 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 25012 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 15790 | |
| 1 | 3052 | 12.2% |
| 5 | 2771 | 11.1% |
| 6 | 1370 | 5.5% |
| 2 | 879 | 3.5% |
| 3 | 785 | 3.1% |
| 7 | 305 | 1.2% |
| 8 | 50 | 0.2% |
| 4 | 10 | < 0.1% |
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 4515658 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 315 |
|---|---|
| 2nd row | 112 |
| 3rd row | 151 |
| Value | Count | Frequency (%) |
| 315 | 1 | |
| 112 | 1 | |
| 151 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 5 | |
| 5 | 2 | 22.2% |
| 3 | 1 | 11.1% |
| 2 | 1 | 11.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 9 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 5 | |
| 5 | 2 | 22.2% |
| 3 | 1 | 11.1% |
| 2 | 1 | 11.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 9 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 5 | |
| 5 | 2 | 22.2% |
| 3 | 1 | 11.1% |
| 2 | 1 | 11.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 5 | |
| 5 | 2 | 22.2% |
| 3 | 1 | 11.1% |
| 2 | 1 | 11.1% |
Missing 
| Distinct | 5 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 4515656 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 3 |
| Mean length | 6.2 |
| Min length | 3 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 315 |
|---|---|
| 2nd row | United States |
| 3rd row | Indonesia |
| 4th row | 112 |
| 5th row | 151 |
| Value | Count | Frequency (%) |
| 315 | 1 | |
| united | 1 | |
| states | 1 | |
| indonesia | 1 | |
| 112 | 1 | |
| 151 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 5 | |
| n | 3 | |
| t | 3 | |
| e | 3 | |
| 5 | 2 | 6.5% |
| i | 2 | 6.5% |
| d | 2 | 6.5% |
| a | 2 | 6.5% |
| s | 2 | 6.5% |
| 3 | 1 | 3.2% |
| Other values (6) | 6 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 18 | |
| Decimal Number | 9 | |
| Uppercase Letter | 3 | 9.7% |
| Space Separator | 1 | 3.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 3 | |
| t | 3 | |
| e | 3 | |
| i | 2 | |
| d | 2 | |
| a | 2 | |
| s | 2 | |
| o | 1 | 5.6% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 5 | |
| 5 | 2 | 22.2% |
| 3 | 1 | 11.1% |
| 2 | 1 | 11.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 1 | |
| S | 1 | |
| I | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 21 | |
| Common | 10 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 3 | |
| t | 3 | |
| e | 3 | |
| i | 2 | |
| d | 2 | |
| a | 2 | |
| s | 2 | |
| U | 1 | 4.8% |
| S | 1 | 4.8% |
| I | 1 | 4.8% |
Common
| Value | Count | Frequency (%) |
| 1 | 5 | |
| 5 | 2 | 20.0% |
| 3 | 1 | 10.0% |
| 1 | 10.0% | |
| 2 | 1 | 10.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 31 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 5 | |
| n | 3 | |
| t | 3 | |
| e | 3 | |
| 5 | 2 | 6.5% |
| i | 2 | 6.5% |
| d | 2 | 6.5% |
| a | 2 | 6.5% |
| s | 2 | 6.5% |
| 3 | 1 | 3.2% |
| Other values (6) | 6 |
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 4515657 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 1938 |
|---|---|
| 2nd row | 1988 |
| 3rd row | 1883 |
| 4th row | 1907 |
| Value | Count | Frequency (%) |
| 1938 | 1 | |
| 1988 | 1 | |
| 1883 | 1 | |
| 1907 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 8 | 5 | |
| 1 | 4 | |
| 9 | 3 | |
| 3 | 2 | 12.5% |
| 0 | 1 | 6.2% |
| 7 | 1 | 6.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 16 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 5 | |
| 1 | 4 | |
| 9 | 3 | |
| 3 | 2 | 12.5% |
| 0 | 1 | 6.2% |
| 7 | 1 | 6.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 16 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 8 | 5 | |
| 1 | 4 | |
| 9 | 3 | |
| 3 | 2 | 12.5% |
| 0 | 1 | 6.2% |
| 7 | 1 | 6.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 8 | 5 | |
| 1 | 4 | |
| 9 | 3 | |
| 3 | 2 | 12.5% |
| 0 | 1 | 6.2% |
| 7 | 1 | 6.2% |
verbatimLatitude
Text
Missing 
| Distinct | 5653 |
|---|---|
| Distinct (%) | 14.9% |
| Missing | 4477670 |
| Missing (%) | 99.2% |
| Memory size | 34.5 MiB |
Length
| Max length | 66 |
|---|---|
| Median length | 28 |
| Mean length | 8.207575478 |
| Min length | 1 |
Unique
| Unique | 2975 ? |
|---|---|
| Unique (%) | 7.8% |
Sample
| 1st row | 26 16'43"N |
|---|---|
| 2nd row | 24 58.74' N |
| 3rd row | 55 56'N |
| 4th row | 24 47'31"N |
| 5th row | 19.75856 |
| Value | Count | Frequency (%) |
| n | 20751 | |
| 0 | 7933 | 9.0% |
| 26 | 2620 | 3.0% |
| 24 | 2619 | 3.0% |
| 18 | 2373 | 2.7% |
| 16 | 2094 | 2.4% |
| 9 | 1686 | 1.9% |
| 25 | 1666 | 1.9% |
| s | 1527 | 1.7% |
| 2.2228 | 1275 | 1.5% |
| Other values (4210) | 43266 |
Most occurring characters
| Value | Count | Frequency (%) |
| 49819 | ||
| N | 29862 | |
| 2 | 27458 | |
| 3 | 24694 | 7.9% |
| 1 | 23158 | 7.4% |
| 0 | 22687 | 7.3% |
| 4 | 21103 | 6.8% |
| 5 | 17039 | 5.5% |
| 6 | 15718 | 5.0% |
| ' | 15536 | 5.0% |
| Other values (48) | 64740 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 185765 | |
| Space Separator | 49819 | 16.0% |
| Other Punctuation | 40475 | 13.0% |
| Uppercase Letter | 32869 | 10.5% |
| Dash Punctuation | 1489 | 0.5% |
| Lowercase Letter | 1036 | 0.3% |
| Other Symbol | 286 | 0.1% |
| Other Letter | 60 | < 0.1% |
| Math Symbol | 15 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 166 | |
| g | 145 | |
| d | 135 | |
| t | 133 | |
| a | 91 | |
| n | 75 | |
| r | 50 | 4.8% |
| o | 48 | 4.6% |
| i | 46 | 4.4% |
| h | 40 | 3.9% |
| Other values (9) | 107 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 29862 | |
| S | 2903 | 8.8% |
| L | 71 | 0.2% |
| W | 11 | < 0.1% |
| E | 5 | < 0.1% |
| M | 4 | < 0.1% |
| T | 4 | < 0.1% |
| A | 3 | < 0.1% |
| X | 2 | < 0.1% |
| U | 1 | < 0.1% |
| Other values (3) | 3 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 27458 | |
| 3 | 24694 | |
| 1 | 23158 | |
| 0 | 22687 | |
| 4 | 21103 | |
| 5 | 17039 | |
| 6 | 15718 | |
| 8 | 13067 | |
| 9 | 11407 | |
| 7 | 9434 | 5.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 15536 | |
| . | 14598 | |
| " | 9732 | |
| ; | 450 | 1.1% |
| : | 77 | 0.2% |
| , | 42 | 0.1% |
| ? | 29 | 0.1% |
| / | 8 | < 0.1% |
| ′ | 3 | < 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 11 | |
| ~ | 3 | 20.0% |
| + | 1 | 6.7% |
Space Separator
| Value | Count | Frequency (%) |
| 49819 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1489 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 286 |
Other Letter
| Value | Count | Frequency (%) |
| º | 60 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 277849 | |
| Latin | 33965 | 10.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 29862 | |
| S | 2903 | 8.5% |
| e | 166 | 0.5% |
| g | 145 | 0.4% |
| d | 135 | 0.4% |
| t | 133 | 0.4% |
| a | 91 | 0.3% |
| n | 75 | 0.2% |
| L | 71 | 0.2% |
| º | 60 | 0.2% |
| Other values (23) | 324 | 1.0% |
Common
| Value | Count | Frequency (%) |
| 49819 | ||
| 2 | 27458 | |
| 3 | 24694 | |
| 1 | 23158 | |
| 0 | 22687 | |
| 4 | 21103 | |
| 5 | 17039 | 6.1% |
| 6 | 15718 | 5.7% |
| ' | 15536 | 5.6% |
| . | 14598 | 5.3% |
| Other values (15) | 46039 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 311465 | |
| None | 346 | 0.1% |
| Punctuation | 3 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 49819 | ||
| N | 29862 | |
| 2 | 27458 | |
| 3 | 24694 | 7.9% |
| 1 | 23158 | 7.4% |
| 0 | 22687 | 7.3% |
| 4 | 21103 | 6.8% |
| 5 | 17039 | 5.5% |
| 6 | 15718 | 5.0% |
| ' | 15536 | 5.0% |
| Other values (45) | 64391 |
None
| Value | Count | Frequency (%) |
| ° | 286 | |
| º | 60 | 17.3% |
Punctuation
| Value | Count | Frequency (%) |
| ′ | 3 |
Missing 
| Distinct | 5598 |
|---|---|
| Distinct (%) | 14.7% |
| Missing | 4477686 |
| Missing (%) | 99.2% |
| Memory size | 34.5 MiB |
Length
| Max length | 68 |
|---|---|
| Median length | 29 |
| Mean length | 8.566662278 |
| Min length | 1 |
Unique
| Unique | 2941 ? |
|---|---|
| Unique (%) | 7.7% |
Sample
| 1st row | 83 46'49"W |
|---|---|
| 2nd row | 76 12.75' W |
| 3rd row | 11 55'E |
| 4th row | 83 41'11"W |
| 5th row | -97.63925 |
| Value | Count | Frequency (%) |
| w | 11522 | 13.1% |
| e | 10550 | 12.0% |
| 0 | 7784 | 8.8% |
| 82 | 2500 | 2.8% |
| 88 | 1824 | 2.1% |
| 79 | 1618 | 1.8% |
| 83 | 1559 | 1.8% |
| 51 | 1335 | 1.5% |
| 9.91722 | 1275 | 1.4% |
| 48.5 | 1155 | 1.3% |
| Other values (4274) | 46881 |
Most occurring characters
| Value | Count | Frequency (%) |
| 50028 | ||
| 0 | 24928 | 7.7% |
| 1 | 24695 | 7.6% |
| 7 | 23700 | 7.3% |
| W | 20627 | 6.3% |
| 2 | 19465 | 6.0% |
| 4 | 19360 | 6.0% |
| 5 | 19032 | 5.9% |
| 8 | 18801 | 5.8% |
| 3 | 16059 | 4.9% |
| Other values (41) | 88624 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 195862 | |
| Space Separator | 50028 | 15.4% |
| Other Punctuation | 41461 | 12.7% |
| Uppercase Letter | 32895 | 10.1% |
| Dash Punctuation | 3709 | 1.1% |
| Lowercase Letter | 1034 | 0.3% |
| Other Symbol | 270 | 0.1% |
| Other Letter | 60 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| g | 218 | |
| e | 187 | |
| d | 135 | |
| n | 118 | |
| o | 77 | 7.4% |
| s | 63 | 6.1% |
| t | 48 | 4.6% |
| i | 47 | 4.5% |
| a | 38 | 3.7% |
| w | 35 | 3.4% |
| Other values (7) | 68 | 6.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 20627 | |
| E | 11910 | |
| O | 142 | 0.4% |
| N | 120 | 0.4% |
| L | 84 | 0.3% |
| M | 3 | < 0.1% |
| T | 3 | < 0.1% |
| Y | 2 | < 0.1% |
| S | 2 | < 0.1% |
| Q | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 24928 | |
| 1 | 24695 | |
| 7 | 23700 | |
| 2 | 19465 | |
| 4 | 19360 | |
| 5 | 19032 | |
| 8 | 18801 | |
| 3 | 16059 | |
| 9 | 15402 | |
| 6 | 14420 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 15592 | |
| ' | 15535 | |
| " | 9717 | |
| ; | 498 | 1.2% |
| , | 47 | 0.1% |
| : | 28 | 0.1% |
| ? | 26 | 0.1% |
| / | 15 | < 0.1% |
| ′ | 3 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 50028 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3709 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 270 |
Other Letter
| Value | Count | Frequency (%) |
| º | 60 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 291330 | |
| Latin | 33989 | 10.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| W | 20627 | |
| E | 11910 | |
| g | 218 | 0.6% |
| e | 187 | 0.6% |
| O | 142 | 0.4% |
| d | 135 | 0.4% |
| N | 120 | 0.4% |
| n | 118 | 0.3% |
| L | 84 | 0.2% |
| o | 77 | 0.2% |
| Other values (19) | 371 | 1.1% |
Common
| Value | Count | Frequency (%) |
| 50028 | ||
| 0 | 24928 | |
| 1 | 24695 | |
| 7 | 23700 | 8.1% |
| 2 | 19465 | 6.7% |
| 4 | 19360 | 6.6% |
| 5 | 19032 | 6.5% |
| 8 | 18801 | 6.5% |
| 3 | 16059 | 5.5% |
| . | 15592 | 5.4% |
| Other values (12) | 59670 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 324986 | |
| None | 330 | 0.1% |
| Punctuation | 3 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 50028 | ||
| 0 | 24928 | 7.7% |
| 1 | 24695 | 7.6% |
| 7 | 23700 | 7.3% |
| W | 20627 | 6.3% |
| 2 | 19465 | 6.0% |
| 4 | 19360 | 6.0% |
| 5 | 19032 | 5.9% |
| 8 | 18801 | 5.8% |
| 3 | 16059 | 4.9% |
| Other values (38) | 88291 |
None
| Value | Count | Frequency (%) |
| ° | 270 | |
| º | 60 | 18.2% |
Punctuation
| Value | Count | Frequency (%) |
| ′ | 3 |
Missing 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4478628 |
| Missing (%) | 99.2% |
| Memory size | 34.5 MiB |
Length
| Max length | 23 |
|---|---|
| Median length | 23 |
| Mean length | 22.98007183 |
| Min length | 4 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Degrees Minutes Seconds |
|---|---|
| 2nd row | Degrees Minutes Seconds |
| 3rd row | Degrees Minutes Seconds |
| 4th row | Degrees Minutes Seconds |
| 5th row | Degrees Minutes Seconds |
| Value | Count | Frequency (%) |
| degrees | 37003 | |
| minutes | 36978 | |
| seconds | 36978 | |
| decimal | 25 | < 0.1% |
| quad | 22 | < 0.1% |
| unknown | 6 | < 0.1% |
| 11 | 1 | < 0.1% |
| nov | 1 | < 0.1% |
| 1938 | 1 | < 0.1% |
| 21 | 1 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 184990 | |
| s | 110959 | |
| 73985 | 8.7% | |
| n | 73974 | 8.7% |
| D | 37025 | 4.4% |
| r | 37004 | 4.3% |
| i | 37003 | 4.3% |
| c | 37003 | 4.3% |
| d | 37003 | 4.3% |
| g | 37003 | 4.3% |
| Other values (21) | 185072 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 665969 | |
| Uppercase Letter | 111055 | 13.0% |
| Space Separator | 73985 | 8.7% |
| Decimal Number | 12 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 184990 | |
| s | 110959 | |
| n | 73974 | 11.1% |
| r | 37004 | 5.6% |
| i | 37003 | 5.6% |
| c | 37003 | 5.6% |
| d | 37003 | 5.6% |
| g | 37003 | 5.6% |
| o | 36985 | 5.6% |
| u | 36978 | 5.6% |
| Other values (8) | 37067 | 5.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 37025 | |
| M | 36978 | |
| S | 36978 | |
| U | 28 | < 0.1% |
| A | 23 | < 0.1% |
| Q | 22 | < 0.1% |
| N | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 5 | |
| 8 | 3 | |
| 9 | 2 | 16.7% |
| 3 | 1 | 8.3% |
| 2 | 1 | 8.3% |
Space Separator
| Value | Count | Frequency (%) |
| 73985 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 777024 | |
| Common | 73997 | 8.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 184990 | |
| s | 110959 | |
| n | 73974 | 9.5% |
| D | 37025 | 4.8% |
| r | 37004 | 4.8% |
| i | 37003 | 4.8% |
| c | 37003 | 4.8% |
| d | 37003 | 4.8% |
| g | 37003 | 4.8% |
| o | 36985 | 4.8% |
| Other values (15) | 148075 |
Common
| Value | Count | Frequency (%) |
| 73985 | ||
| 1 | 5 | < 0.1% |
| 8 | 3 | < 0.1% |
| 9 | 2 | < 0.1% |
| 3 | 1 | < 0.1% |
| 2 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 851021 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 184990 | |
| s | 110959 | |
| 73985 | 8.7% | |
| n | 73974 | 8.7% |
| D | 37025 | 4.4% |
| r | 37004 | 4.3% |
| i | 37003 | 4.3% |
| c | 37003 | 4.3% |
| d | 37003 | 4.3% |
| g | 37003 | 4.3% |
| Other values (21) | 185072 |
verbatimSRS
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 4515660 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 13 |
| Min length | 13 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | San Francisco |
|---|
| Value | Count | Frequency (%) |
| san | 1 | |
| francisco | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2 | |
| n | 2 | |
| c | 2 | |
| S | 1 | |
| 1 | ||
| F | 1 | |
| r | 1 | |
| i | 1 | |
| s | 1 | |
| o | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10 | |
| Uppercase Letter | 2 | 15.4% |
| Space Separator | 1 | 7.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2 | |
| n | 2 | |
| c | 2 | |
| r | 1 | |
| i | 1 | |
| s | 1 | |
| o | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 1 | |
| F | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12 | |
| Common | 1 | 7.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2 | |
| n | 2 | |
| c | 2 | |
| S | 1 | |
| F | 1 | |
| r | 1 | |
| i | 1 | |
| s | 1 | |
| o | 1 |
Common
| Value | Count | Frequency (%) |
| 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 2 | |
| n | 2 | |
| c | 2 | |
| S | 1 | |
| 1 | ||
| F | 1 | |
| r | 1 | |
| i | 1 | |
| s | 1 | |
| o | 1 |
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 4515658 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 33 |
|---|---|
| Median length | 19 |
| Mean length | 23 |
| Min length | 17 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Asplenium monanthes |
|---|---|
| 2nd row | Nymphoides indica |
| 3rd row | Kohleria inaequalis var. lindenii |
| Value | Count | Frequency (%) |
| asplenium | 1 | |
| monanthes | 1 | |
| nymphoides | 1 | |
| indica | 1 | |
| kohleria | 1 | |
| inaequalis | 1 | |
| var | 1 | |
| lindenii | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 10 | |
| n | 7 | 10.1% |
| a | 6 | 8.7% |
| e | 6 | 8.7% |
| 5 | 7.2% | |
| l | 4 | 5.8% |
| s | 4 | 5.8% |
| h | 3 | 4.3% |
| o | 3 | 4.3% |
| m | 3 | 4.3% |
| Other values (13) | 18 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 60 | |
| Space Separator | 5 | 7.2% |
| Uppercase Letter | 3 | 4.3% |
| Other Punctuation | 1 | 1.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 10 | |
| n | 7 | |
| a | 6 | |
| e | 6 | |
| l | 4 | 6.7% |
| s | 4 | 6.7% |
| h | 3 | 5.0% |
| o | 3 | 5.0% |
| m | 3 | 5.0% |
| d | 3 | 5.0% |
| Other values (8) | 11 |
Uppercase Letter
| Value | Count | Frequency (%) |
| K | 1 | |
| A | 1 | |
| N | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 5 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 63 | |
| Common | 6 | 8.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 10 | |
| n | 7 | |
| a | 6 | |
| e | 6 | |
| l | 4 | 6.3% |
| s | 4 | 6.3% |
| h | 3 | 4.8% |
| o | 3 | 4.8% |
| m | 3 | 4.8% |
| d | 3 | 4.8% |
| Other values (11) | 14 |
Common
| Value | Count | Frequency (%) |
| 5 | ||
| . | 1 | 16.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 69 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 10 | |
| n | 7 | 10.1% |
| a | 6 | 8.7% |
| e | 6 | 8.7% |
| 5 | 7.2% | |
| l | 4 | 5.8% |
| s | 4 | 5.8% |
| h | 3 | 4.3% |
| o | 3 | 4.3% |
| m | 3 | 4.3% |
| Other values (13) | 18 |
Missing 
| Distinct | 20 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4388537 |
| Missing (%) | 97.2% |
| Memory size | 34.5 MiB |
Length
| Max length | 17 |
|---|---|
| Median length | 16 |
| Mean length | 8.340360593 |
| Min length | 3 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Gazetteer |
|---|---|
| 2nd row | Gazetteer |
| 3rd row | Gazetteer |
| 4th row | Gazetteer |
| 5th row | Label |
| Value | Count | Frequency (%) |
| gazetteer | 49173 | |
| gps | 23394 | |
| gis | 20969 | |
| arcview | 20969 | |
| label | 17064 | 10.3% |
| 15524 | 9.4% | |
| maps | 12428 | 7.5% |
| earth | 3096 | 1.9% |
| source | 1688 | 1.0% |
| g-1 | 397 | 0.2% |
| Other values (11) | 704 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 203052 | |
| G | 109414 | 10.3% |
| t | 101502 | 9.6% |
| a | 82086 | 7.7% |
| r | 74984 | 7.1% |
| z | 49173 | 4.6% |
| S | 45993 | 4.3% |
| 38282 | 3.6% | |
| o | 32954 | 3.1% |
| l | 32699 | 3.1% |
| Other values (30) | 290121 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 728213 | |
| Uppercase Letter | 251029 | 23.7% |
| Space Separator | 38282 | 3.6% |
| Close Punctuation | 20969 | 2.0% |
| Open Punctuation | 20969 | 2.0% |
| Dash Punctuation | 397 | < 0.1% |
| Decimal Number | 397 | < 0.1% |
| Other Punctuation | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 203052 | |
| t | 101502 | |
| a | 82086 | |
| r | 74984 | 10.3% |
| z | 49173 | 6.8% |
| o | 32954 | 4.5% |
| l | 32699 | 4.5% |
| c | 22672 | 3.1% |
| i | 21073 | 2.9% |
| w | 20981 | 2.9% |
| Other values (12) | 87037 |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 109414 | |
| S | 45993 | |
| P | 23336 | 9.3% |
| I | 20969 | 8.4% |
| A | 20969 | 8.4% |
| L | 17057 | 6.8% |
| M | 9952 | 4.0% |
| E | 3096 | 1.2% |
| W | 176 | 0.1% |
| C | 55 | < 0.1% |
| Other values (2) | 12 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 38282 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 20969 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 20969 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 397 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 397 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 979242 | |
| Common | 81018 | 7.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 203052 | |
| G | 109414 | |
| t | 101502 | |
| a | 82086 | 8.4% |
| r | 74984 | 7.7% |
| z | 49173 | 5.0% |
| S | 45993 | 4.7% |
| o | 32954 | 3.4% |
| l | 32699 | 3.3% |
| P | 23336 | 2.4% |
| Other values (24) | 224049 |
Common
| Value | Count | Frequency (%) |
| 38282 | ||
| ) | 20969 | |
| ( | 20969 | |
| - | 397 | 0.5% |
| 1 | 397 | 0.5% |
| . | 4 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1060260 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 203052 | |
| G | 109414 | 10.3% |
| t | 101502 | 9.6% |
| a | 82086 | 7.7% |
| r | 74984 | 7.1% |
| z | 49173 | 4.6% |
| S | 45993 | 4.3% |
| 38282 | 3.6% | |
| o | 32954 | 3.1% |
| l | 32699 | 3.1% |
| Other values (30) | 290121 |
Missing 
| Distinct | 78 |
|---|---|
| Distinct (%) | 15.3% |
| Missing | 4515150 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 67 |
|---|---|
| Median length | 57 |
| Mean length | 19.70254403 |
| Min length | 1 |
Unique
| Unique | 29 ? |
|---|---|
| Unique (%) | 5.7% |
Sample
| 1st row | +-1000m |
|---|---|
| 2nd row | stop 1 - beginning of bike path, along GW pkwy |
| 3rd row | ca.; ca. |
| 4th row | stop 1-ditch; stop 2- polkweed; stop 3; stop 4 |
| 5th row | Long. 4 8 W - 4 15 W |
| Value | Count | Frequency (%) |
| stop | 226 | 10.5% |
| 4 | 144 | 6.7% |
| 119 | 5.5% | |
| w | 106 | 4.9% |
| ca | 90 | 4.2% |
| 1 | 86 | 4.0% |
| seconds | 56 | 2.6% |
| long | 53 | 2.5% |
| 15 | 53 | 2.5% |
| 8 | 53 | 2.5% |
| Other values (116) | 1167 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1642 | ||
| o | 738 | 7.3% |
| t | 614 | 6.1% |
| e | 593 | 5.9% |
| a | 549 | 5.5% |
| n | 536 | 5.3% |
| i | 494 | 4.9% |
| s | 437 | 4.3% |
| p | 406 | 4.0% |
| l | 382 | 3.8% |
| Other values (54) | 3677 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6599 | |
| Space Separator | 1642 | 16.3% |
| Uppercase Letter | 653 | 6.5% |
| Decimal Number | 614 | 6.1% |
| Other Punctuation | 341 | 3.4% |
| Dash Punctuation | 195 | 1.9% |
| Math Symbol | 20 | 0.2% |
| Close Punctuation | 2 | < 0.1% |
| Open Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 738 | |
| t | 614 | |
| e | 593 | |
| a | 549 | 8.3% |
| n | 536 | 8.1% |
| i | 494 | 7.5% |
| s | 437 | 6.6% |
| p | 406 | 6.2% |
| l | 382 | 5.8% |
| d | 372 | 5.6% |
| Other values (14) | 1478 |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 143 | |
| S | 113 | |
| L | 91 | |
| G | 47 | 7.2% |
| A | 36 | 5.5% |
| F | 28 | 4.3% |
| M | 28 | 4.3% |
| T | 28 | 4.3% |
| C | 26 | 4.0% |
| U | 25 | 3.8% |
| Other values (9) | 88 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 197 | |
| 4 | 156 | |
| 8 | 64 | 10.4% |
| 5 | 58 | 9.4% |
| 0 | 53 | 8.6% |
| 2 | 42 | 6.8% |
| 3 | 40 | 6.5% |
| 6 | 3 | 0.5% |
| 7 | 1 | 0.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 205 | |
| ; | 90 | |
| , | 33 | 9.7% |
| / | 10 | 2.9% |
| : | 2 | 0.6% |
| ' | 1 | 0.3% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 17 | |
| ± | 3 | 15.0% |
Space Separator
| Value | Count | Frequency (%) |
| 1642 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 195 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7252 | |
| Common | 2816 | 28.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 738 | 10.2% |
| t | 614 | 8.5% |
| e | 593 | 8.2% |
| a | 549 | 7.6% |
| n | 536 | 7.4% |
| i | 494 | 6.8% |
| s | 437 | 6.0% |
| p | 406 | 5.6% |
| l | 382 | 5.3% |
| d | 372 | 5.1% |
| Other values (33) | 2131 |
Common
| Value | Count | Frequency (%) |
| 1642 | ||
| . | 205 | 7.3% |
| 1 | 197 | 7.0% |
| - | 195 | 6.9% |
| 4 | 156 | 5.5% |
| ; | 90 | 3.2% |
| 8 | 64 | 2.3% |
| 5 | 58 | 2.1% |
| 0 | 53 | 1.9% |
| 2 | 42 | 1.5% |
| Other values (11) | 114 | 4.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10065 | |
| None | 3 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1642 | ||
| o | 738 | 7.3% |
| t | 614 | 6.1% |
| e | 593 | 5.9% |
| a | 549 | 5.5% |
| n | 536 | 5.3% |
| i | 494 | 4.9% |
| s | 437 | 4.3% |
| p | 406 | 4.0% |
| l | 382 | 3.8% |
| Other values (53) | 3674 |
None
| Value | Count | Frequency (%) |
| ± | 3 |
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 4515657 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 56 |
|---|---|
| Median length | 49 |
| Mean length | 49 |
| Min length | 42 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | South America - Neotropics, Colombia, Meta |
|---|---|
| 2nd row | South America - Neotropics, Ecuador, Morona-Santiago |
| 3rd row | North America, United States, California, San Bernardino |
| 4th row | North America, United States, Arizona, Cochise |
| Value | Count | Frequency (%) |
| america | 4 | |
| south | 2 | 8.0% |
| 2 | 8.0% | |
| neotropics | 2 | 8.0% |
| north | 2 | 8.0% |
| united | 2 | 8.0% |
| states | 2 | 8.0% |
| colombia | 1 | 4.0% |
| meta | 1 | 4.0% |
| ecuador | 1 | 4.0% |
| Other values (6) | 6 |
Most occurring characters
| Value | Count | Frequency (%) |
| 21 | 10.7% | |
| o | 18 | 9.2% |
| a | 17 | 8.7% |
| i | 15 | 7.7% |
| t | 14 | 7.1% |
| r | 14 | 7.1% |
| e | 13 | 6.6% |
| , | 10 | 5.1% |
| n | 9 | 4.6% |
| c | 8 | 4.1% |
| Other values (20) | 57 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 138 | |
| Uppercase Letter | 24 | 12.2% |
| Space Separator | 21 | 10.7% |
| Other Punctuation | 10 | 5.1% |
| Dash Punctuation | 3 | 1.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 18 | |
| a | 17 | |
| i | 15 | |
| t | 14 | |
| r | 14 | |
| e | 13 | |
| n | 9 | |
| c | 8 | 5.8% |
| m | 5 | 3.6% |
| h | 5 | 3.6% |
| Other values (9) | 20 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 6 | |
| A | 5 | |
| N | 4 | |
| C | 3 | |
| U | 2 | 8.3% |
| M | 2 | 8.3% |
| E | 1 | 4.2% |
| B | 1 | 4.2% |
Space Separator
| Value | Count | Frequency (%) |
| 21 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 10 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 162 | |
| Common | 34 | 17.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 18 | |
| a | 17 | 10.5% |
| i | 15 | 9.3% |
| t | 14 | 8.6% |
| r | 14 | 8.6% |
| e | 13 | 8.0% |
| n | 9 | 5.6% |
| c | 8 | 4.9% |
| S | 6 | 3.7% |
| m | 5 | 3.1% |
| Other values (17) | 43 |
Common
| Value | Count | Frequency (%) |
| 21 | ||
| , | 10 | |
| - | 3 | 8.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 196 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 21 | 10.7% | |
| o | 18 | 9.2% |
| a | 17 | 8.7% |
| i | 15 | 7.7% |
| t | 14 | 7.1% |
| r | 14 | 7.1% |
| e | 13 | 6.6% |
| , | 10 | 5.1% |
| n | 9 | 4.6% |
| c | 8 | 4.1% |
| Other values (20) | 57 |
earliestEonOrLowestEonothem
Text
Missing 
| Distinct | 5 |
|---|---|
| Distinct (%) | 71.4% |
| Missing | 4515654 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 49 |
|---|---|
| Median length | 48 |
| Mean length | 31.57142857 |
| Min length | 13 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 42.9% |
Sample
| 1st row | South America - Neotropics |
|---|---|
| 2nd row | Plantae, Pteridophyte, Polypodiales, Aspleniaceae |
| 3rd row | Plantae, Dicotyledonae, Asterales, Menyanthaceae |
| 4th row | Plantae, Dicotyledonae, Lamiales, Gesneriaceae |
| 5th row | South America - Neotropics |
| Value | Count | Frequency (%) |
| america | 4 | |
| plantae | 3 | |
| south | 2 | |
| 2 | ||
| neotropics | 2 | |
| north | 2 | |
| dicotyledonae | 2 | |
| pteridophyte | 1 | 4.2% |
| polypodiales | 1 | 4.2% |
| aspleniaceae | 1 | 4.2% |
| Other values (4) | 4 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 29 | |
| a | 23 | 10.4% |
| 17 | 7.7% | |
| o | 15 | 6.8% |
| t | 15 | 6.8% |
| i | 13 | 5.9% |
| c | 11 | 5.0% |
| r | 11 | 5.0% |
| l | 10 | 4.5% |
| , | 9 | 4.1% |
| Other values (17) | 68 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 171 | |
| Uppercase Letter | 22 | 10.0% |
| Space Separator | 17 | 7.7% |
| Other Punctuation | 9 | 4.1% |
| Dash Punctuation | 2 | 0.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 29 | |
| a | 23 | |
| o | 15 | |
| t | 15 | |
| i | 13 | |
| c | 11 | 6.4% |
| r | 11 | 6.4% |
| l | 10 | 5.8% |
| n | 9 | 5.3% |
| s | 8 | 4.7% |
| Other values (6) | 27 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 6 | |
| P | 5 | |
| N | 4 | |
| D | 2 | 9.1% |
| S | 2 | 9.1% |
| M | 1 | 4.5% |
| L | 1 | 4.5% |
| G | 1 | 4.5% |
Space Separator
| Value | Count | Frequency (%) |
| 17 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 9 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 193 | |
| Common | 28 | 12.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 29 | |
| a | 23 | |
| o | 15 | 7.8% |
| t | 15 | 7.8% |
| i | 13 | 6.7% |
| c | 11 | 5.7% |
| r | 11 | 5.7% |
| l | 10 | 5.2% |
| n | 9 | 4.7% |
| s | 8 | 4.1% |
| Other values (14) | 49 |
Common
| Value | Count | Frequency (%) |
| 17 | ||
| , | 9 | |
| - | 2 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 221 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 29 | |
| a | 23 | 10.4% |
| 17 | 7.7% | |
| o | 15 | 6.8% |
| t | 15 | 6.8% |
| i | 13 | 5.9% |
| c | 11 | 5.0% |
| r | 11 | 5.0% |
| l | 10 | 4.5% |
| , | 9 | 4.1% |
| Other values (17) | 68 |
latestEonOrHighestEonothem
Text
Missing 
| Distinct | 5 |
|---|---|
| Distinct (%) | 71.4% |
| Missing | 4515654 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 27 |
|---|---|
| Median length | 13 |
| Mean length | 12.14285714 |
| Min length | 7 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 57.1% |
Sample
| 1st row | Algae name updating Project |
|---|---|
| 2nd row | Earle, S. A. |
| 3rd row | Plantae |
| 4th row | Plantae |
| 5th row | Blair, S. M. |
| Value | Count | Frequency (%) |
| plantae | 3 | |
| s | 2 | |
| algae | 1 | 6.2% |
| name | 1 | 6.2% |
| updating | 1 | 6.2% |
| project | 1 | 6.2% |
| earle | 1 | 6.2% |
| a | 1 | 6.2% |
| blair | 1 | 6.2% |
| m | 1 | 6.2% |
| Other values (3) | 3 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 11 | |
| 9 | 10.6% | |
| e | 8 | 9.4% |
| n | 7 | 8.2% |
| l | 6 | 7.1% |
| . | 6 | 7.1% |
| t | 5 | 5.9% |
| P | 4 | 4.7% |
| , | 3 | 3.5% |
| r | 3 | 3.5% |
| Other values (17) | 23 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 53 | |
| Uppercase Letter | 14 | 16.5% |
| Space Separator | 9 | 10.6% |
| Other Punctuation | 9 | 10.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 11 | |
| e | 8 | |
| n | 7 | |
| l | 6 | |
| t | 5 | |
| r | 3 | 5.7% |
| i | 3 | 5.7% |
| g | 2 | 3.8% |
| p | 1 | 1.9% |
| d | 1 | 1.9% |
| Other values (6) | 6 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 4 | |
| S | 2 | |
| A | 2 | |
| D | 2 | |
| E | 1 | 7.1% |
| B | 1 | 7.1% |
| M | 1 | 7.1% |
| G | 1 | 7.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 6 | |
| , | 3 |
Space Separator
| Value | Count | Frequency (%) |
| 9 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 67 | |
| Common | 18 | 21.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 11 | |
| e | 8 | |
| n | 7 | |
| l | 6 | 9.0% |
| t | 5 | 7.5% |
| P | 4 | 6.0% |
| r | 3 | 4.5% |
| i | 3 | 4.5% |
| g | 2 | 3.0% |
| S | 2 | 3.0% |
| Other values (14) | 16 |
Common
| Value | Count | Frequency (%) |
| 9 | ||
| . | 6 | |
| , | 3 | 16.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 85 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 11 | |
| 9 | 10.6% | |
| e | 8 | 9.4% |
| n | 7 | 8.2% |
| l | 6 | 7.1% |
| . | 6 | 7.1% |
| t | 5 | 5.9% |
| P | 4 | 4.7% |
| , | 3 | 3.5% |
| r | 3 | 3.5% |
| Other values (17) | 23 |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 66.7% |
| Missing | 4515658 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 12.66666667 |
| Min length | 12 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 33.3% |
Sample
| 1st row | Pteridophyte |
|---|---|
| 2nd row | Dicotyledonae |
| 3rd row | Dicotyledonae |
| Value | Count | Frequency (%) |
| dicotyledonae | 2 | |
| pteridophyte | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 6 | |
| o | 5 | |
| t | 4 | |
| i | 3 | |
| y | 3 | |
| d | 3 | |
| D | 2 | 5.3% |
| c | 2 | 5.3% |
| l | 2 | 5.3% |
| n | 2 | 5.3% |
| Other values (5) | 6 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 35 | |
| Uppercase Letter | 3 | 7.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 6 | |
| o | 5 | |
| t | 4 | |
| i | 3 | |
| y | 3 | |
| d | 3 | |
| c | 2 | 5.7% |
| l | 2 | 5.7% |
| n | 2 | 5.7% |
| a | 2 | 5.7% |
| Other values (3) | 3 |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 2 | |
| P | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 38 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 6 | |
| o | 5 | |
| t | 4 | |
| i | 3 | |
| y | 3 | |
| d | 3 | |
| D | 2 | 5.3% |
| c | 2 | 5.3% |
| l | 2 | 5.3% |
| n | 2 | 5.3% |
| Other values (5) | 6 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 38 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 6 | |
| o | 5 | |
| t | 4 | |
| i | 3 | |
| y | 3 | |
| d | 3 | |
| D | 2 | 5.3% |
| c | 2 | 5.3% |
| l | 2 | 5.3% |
| n | 2 | 5.3% |
| Other values (5) | 6 |
earliestPeriodOrLowestSystem
Text
Missing 
| Distinct | 6 |
|---|---|
| Distinct (%) | 85.7% |
| Missing | 4515654 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 12 |
| Mean length | 10 |
| Min length | 7 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | 71.4% |
Sample
| 1st row | Colombia |
|---|---|
| 2nd row | Polypodiales |
| 3rd row | Asterales |
| 4th row | Lamiales |
| 5th row | Ecuador |
| Value | Count | Frequency (%) |
| united | 2 | |
| states | 2 | |
| colombia | 1 | |
| polypodiales | 1 | |
| asterales | 1 | |
| lamiales | 1 | |
| ecuador | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 8 | |
| a | 8 | |
| t | 7 | 10.0% |
| s | 6 | 8.6% |
| l | 5 | 7.1% |
| o | 5 | 7.1% |
| i | 5 | 7.1% |
| d | 4 | 5.7% |
| r | 2 | 2.9% |
| m | 2 | 2.9% |
| Other values (14) | 18 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 59 | |
| Uppercase Letter | 9 | 12.9% |
| Space Separator | 2 | 2.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 8 | |
| a | 8 | |
| t | 7 | |
| s | 6 | |
| l | 5 | |
| o | 5 | |
| i | 5 | |
| d | 4 | |
| r | 2 | 3.4% |
| m | 2 | 3.4% |
| Other values (6) | 7 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 2 | |
| S | 2 | |
| C | 1 | |
| P | 1 | |
| A | 1 | |
| L | 1 | |
| E | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 68 | |
| Common | 2 | 2.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 8 | |
| a | 8 | |
| t | 7 | |
| s | 6 | 8.8% |
| l | 5 | 7.4% |
| o | 5 | 7.4% |
| i | 5 | 7.4% |
| d | 4 | 5.9% |
| r | 2 | 2.9% |
| m | 2 | 2.9% |
| Other values (13) | 16 |
Common
| Value | Count | Frequency (%) |
| 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 70 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 8 | |
| a | 8 | |
| t | 7 | 10.0% |
| s | 6 | 8.6% |
| l | 5 | 7.1% |
| o | 5 | 7.1% |
| i | 5 | 7.1% |
| d | 4 | 5.7% |
| r | 2 | 2.9% |
| m | 2 | 2.9% |
| Other values (14) | 18 |
earliestEpochOrLowestSeries
Text
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 4515654 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 15 |
|---|---|
| Median length | 12 |
| Mean length | 10.42857143 |
| Min length | 4 |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Meta |
|---|---|
| 2nd row | Aspleniaceae |
| 3rd row | Menyanthaceae |
| 4th row | Gesneriaceae |
| 5th row | Morona-Santiago |
| Value | Count | Frequency (%) |
| meta | 1 | |
| aspleniaceae | 1 | |
| menyanthaceae | 1 | |
| gesneriaceae | 1 | |
| morona-santiago | 1 | |
| california | 1 | |
| arizona | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 14 | |
| e | 11 | |
| n | 8 | |
| i | 6 | |
| o | 5 | 6.8% |
| r | 4 | 5.5% |
| M | 3 | 4.1% |
| t | 3 | 4.1% |
| c | 3 | 4.1% |
| A | 2 | 2.7% |
| Other values (12) | 14 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 64 | |
| Uppercase Letter | 8 | 11.0% |
| Dash Punctuation | 1 | 1.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 14 | |
| e | 11 | |
| n | 8 | |
| i | 6 | |
| o | 5 | 7.8% |
| r | 4 | 6.2% |
| t | 3 | 4.7% |
| c | 3 | 4.7% |
| s | 2 | 3.1% |
| l | 2 | 3.1% |
| Other values (6) | 6 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 3 | |
| A | 2 | |
| S | 1 | 12.5% |
| C | 1 | 12.5% |
| G | 1 | 12.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 72 | |
| Common | 1 | 1.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 14 | |
| e | 11 | |
| n | 8 | |
| i | 6 | |
| o | 5 | 6.9% |
| r | 4 | 5.6% |
| M | 3 | 4.2% |
| t | 3 | 4.2% |
| c | 3 | 4.2% |
| A | 2 | 2.8% |
| Other values (11) | 13 |
Common
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 73 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 14 | |
| e | 11 | |
| n | 8 | |
| i | 6 | |
| o | 5 | 6.8% |
| r | 4 | 5.5% |
| M | 3 | 4.1% |
| t | 3 | 4.1% |
| c | 3 | 4.1% |
| A | 2 | 2.7% |
| Other values (12) | 14 |
latestEpochOrHighestSeries
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 4515659 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 14 |
|---|---|
| Median length | 10.5 |
| Mean length | 10.5 |
| Min length | 7 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | San Bernardino |
|---|---|
| 2nd row | Cochise |
| Value | Count | Frequency (%) |
| san | 1 | |
| bernardino | 1 | |
| cochise | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 3 | |
| a | 2 | |
| e | 2 | |
| r | 2 | |
| i | 2 | |
| o | 2 | |
| S | 1 | 4.8% |
| 1 | 4.8% | |
| B | 1 | 4.8% |
| d | 1 | 4.8% |
| Other values (4) | 4 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 17 | |
| Uppercase Letter | 3 | 14.3% |
| Space Separator | 1 | 4.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 3 | |
| a | 2 | |
| e | 2 | |
| r | 2 | |
| i | 2 | |
| o | 2 | |
| d | 1 | 5.9% |
| c | 1 | 5.9% |
| h | 1 | 5.9% |
| s | 1 | 5.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 1 | |
| B | 1 | |
| C | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 20 | |
| Common | 1 | 4.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 3 | |
| a | 2 | |
| e | 2 | |
| r | 2 | |
| i | 2 | |
| o | 2 | |
| S | 1 | 5.0% |
| B | 1 | 5.0% |
| d | 1 | 5.0% |
| C | 1 | 5.0% |
| Other values (3) | 3 |
Common
| Value | Count | Frequency (%) |
| 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 21 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 3 | |
| a | 2 | |
| e | 2 | |
| r | 2 | |
| i | 2 | |
| o | 2 | |
| S | 1 | 4.8% |
| 1 | 4.8% | |
| B | 1 | 4.8% |
| d | 1 | 4.8% |
| Other values (4) | 4 |
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 4515657 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 84 |
|---|---|
| Median length | 46 |
| Mean length | 47.75 |
| Min length | 15 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Villa Vicencia. |
|---|---|
| 2nd row | Morona-Santiago: Cordillera del Boliche; about 60 km from Limon south to Gualaquiza. |
| 3rd row | Southern California. Lugonia. San Berdo Co. |
| 4th row | Chiricahua Mountains, Barfoot Park, stony knolls. |
| Value | Count | Frequency (%) |
| villa | 1 | 3.8% |
| vicencia | 1 | 3.8% |
| stony | 1 | 3.8% |
| park | 1 | 3.8% |
| barfoot | 1 | 3.8% |
| mountains | 1 | 3.8% |
| chiricahua | 1 | 3.8% |
| co | 1 | 3.8% |
| berdo | 1 | 3.8% |
| san | 1 | 3.8% |
| Other values (16) | 16 |
Most occurring characters
| Value | Count | Frequency (%) |
| 22 | 11.5% | |
| o | 20 | 10.5% |
| a | 19 | 9.9% |
| i | 14 | 7.3% |
| n | 12 | 6.3% |
| r | 10 | 5.2% |
| l | 10 | 5.2% |
| u | 8 | 4.2% |
| t | 8 | 4.2% |
| e | 6 | 3.1% |
| Other values (27) | 62 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 138 | |
| Space Separator | 22 | 11.5% |
| Uppercase Letter | 18 | 9.4% |
| Other Punctuation | 10 | 5.2% |
| Decimal Number | 2 | 1.0% |
| Dash Punctuation | 1 | 0.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 20 | |
| a | 19 | |
| i | 14 | |
| n | 12 | |
| r | 10 | 7.2% |
| l | 10 | 7.2% |
| u | 8 | 5.8% |
| t | 8 | 5.8% |
| e | 6 | 4.3% |
| h | 5 | 3.6% |
| Other values (11) | 26 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 4 | |
| B | 3 | |
| S | 3 | |
| L | 2 | |
| V | 2 | |
| M | 2 | |
| G | 1 | 5.6% |
| P | 1 | 5.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 6 | |
| , | 2 | 20.0% |
| ; | 1 | 10.0% |
| : | 1 | 10.0% |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 1 | |
| 0 | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 22 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 156 | |
| Common | 35 | 18.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 20 | |
| a | 19 | |
| i | 14 | 9.0% |
| n | 12 | 7.7% |
| r | 10 | 6.4% |
| l | 10 | 6.4% |
| u | 8 | 5.1% |
| t | 8 | 5.1% |
| e | 6 | 3.8% |
| h | 5 | 3.2% |
| Other values (19) | 44 |
Common
| Value | Count | Frequency (%) |
| 22 | ||
| . | 6 | 17.1% |
| , | 2 | 5.7% |
| 6 | 1 | 2.9% |
| 0 | 1 | 2.9% |
| ; | 1 | 2.9% |
| : | 1 | 2.9% |
| - | 1 | 2.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 191 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 22 | 11.5% | |
| o | 20 | 10.5% |
| a | 19 | 9.9% |
| i | 14 | 7.3% |
| n | 12 | 6.3% |
| r | 10 | 5.2% |
| l | 10 | 5.2% |
| u | 8 | 4.2% |
| t | 8 | 4.2% |
| e | 6 | 3.1% |
| Other values (27) | 62 |
lowestBiostratigraphicZone
Text
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 4515658 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 8 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Asplenium |
|---|---|
| 2nd row | Nymphoides |
| 3rd row | Kohleria |
| Value | Count | Frequency (%) |
| asplenium | 1 | |
| nymphoides | 1 | |
| kohleria | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 3 | |
| i | 3 | |
| m | 2 | 7.4% |
| h | 2 | 7.4% |
| p | 2 | 7.4% |
| l | 2 | 7.4% |
| s | 2 | 7.4% |
| o | 2 | 7.4% |
| r | 1 | 3.7% |
| K | 1 | 3.7% |
| Other values (7) | 7 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 24 | |
| Uppercase Letter | 3 | 11.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 3 | |
| i | 3 | |
| m | 2 | |
| h | 2 | |
| p | 2 | |
| l | 2 | |
| s | 2 | |
| o | 2 | |
| r | 1 | 4.2% |
| d | 1 | 4.2% |
| Other values (4) | 4 |
Uppercase Letter
| Value | Count | Frequency (%) |
| K | 1 | |
| A | 1 | |
| N | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 27 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 3 | |
| i | 3 | |
| m | 2 | 7.4% |
| h | 2 | 7.4% |
| p | 2 | 7.4% |
| l | 2 | 7.4% |
| s | 2 | 7.4% |
| o | 2 | 7.4% |
| r | 1 | 3.7% |
| K | 1 | 3.7% |
| Other values (7) | 7 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 27 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 3 | |
| i | 3 | |
| m | 2 | 7.4% |
| h | 2 | 7.4% |
| p | 2 | 7.4% |
| l | 2 | 7.4% |
| s | 2 | 7.4% |
| o | 2 | 7.4% |
| r | 1 | 3.7% |
| K | 1 | 3.7% |
| Other values (7) | 7 |
highestBiostratigraphicZone
Text
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 4515658 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 5.666666667 |
| Min length | 5 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 500.0 |
|---|---|
| 2nd row | 1650.0 |
| 3rd row | 2438.0 |
| Value | Count | Frequency (%) |
| 500.0 | 1 | |
| 1650.0 | 1 | |
| 2438.0 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 6 | |
| . | 3 | |
| 5 | 2 | 11.8% |
| 1 | 1 | 5.9% |
| 6 | 1 | 5.9% |
| 2 | 1 | 5.9% |
| 4 | 1 | 5.9% |
| 3 | 1 | 5.9% |
| 8 | 1 | 5.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 14 | |
| Other Punctuation | 3 | 17.6% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 6 | |
| 5 | 2 | 14.3% |
| 1 | 1 | 7.1% |
| 6 | 1 | 7.1% |
| 2 | 1 | 7.1% |
| 4 | 1 | 7.1% |
| 3 | 1 | 7.1% |
| 8 | 1 | 7.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 17 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 6 | |
| . | 3 | |
| 5 | 2 | 11.8% |
| 1 | 1 | 5.9% |
| 6 | 1 | 5.9% |
| 2 | 1 | 5.9% |
| 4 | 1 | 5.9% |
| 3 | 1 | 5.9% |
| 8 | 1 | 5.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 6 | |
| . | 3 | |
| 5 | 2 | 11.8% |
| 1 | 1 | 5.9% |
| 6 | 1 | 5.9% |
| 2 | 1 | 5.9% |
| 4 | 1 | 5.9% |
| 3 | 1 | 5.9% |
| 8 | 1 | 5.9% |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 4515660 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 500.0 |
|---|
| Value | Count | Frequency (%) |
| 500.0 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3 | |
| 5 | 1 | 20.0% |
| . | 1 | 20.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4 | |
| Other Punctuation | 1 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3 | |
| 5 | 1 | 25.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3 | |
| 5 | 1 | 20.0% |
| . | 1 | 20.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3 | |
| 5 | 1 | 20.0% |
| . | 1 | 20.0% |
formation
Text
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 4515658 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 8.333333333 |
| Min length | 6 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | monanthes |
|---|---|
| 2nd row | indica |
| 3rd row | inaequalis |
| Value | Count | Frequency (%) |
| monanthes | 1 | |
| indica | 1 | |
| inaequalis | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 4 | |
| a | 4 | |
| i | 4 | |
| e | 2 | |
| s | 2 | |
| m | 1 | 4.0% |
| o | 1 | 4.0% |
| t | 1 | 4.0% |
| h | 1 | 4.0% |
| d | 1 | 4.0% |
| Other values (4) | 4 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 25 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 4 | |
| a | 4 | |
| i | 4 | |
| e | 2 | |
| s | 2 | |
| m | 1 | 4.0% |
| o | 1 | 4.0% |
| t | 1 | 4.0% |
| h | 1 | 4.0% |
| d | 1 | 4.0% |
| Other values (4) | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 25 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 4 | |
| a | 4 | |
| i | 4 | |
| e | 2 | |
| s | 2 | |
| m | 1 | 4.0% |
| o | 1 | 4.0% |
| t | 1 | 4.0% |
| h | 1 | 4.0% |
| d | 1 | 4.0% |
| Other values (4) | 4 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 25 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 4 | |
| a | 4 | |
| i | 4 | |
| e | 2 | |
| s | 2 | |
| m | 1 | 4.0% |
| o | 1 | 4.0% |
| t | 1 | 4.0% |
| h | 1 | 4.0% |
| d | 1 | 4.0% |
| Other values (4) | 4 |
member
Text
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 4515654 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 19 |
| Mean length | 19.28571429 |
| Min length | 8 |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Rinorea pubiflora var. pubiflora |
|---|---|
| 2nd row | Agissea simulans |
| 3rd row | Colpomenia sinuosa |
| 4th row | Laurencia intricata |
| 5th row | Avrainvillea nigricans |
| Value | Count | Frequency (%) |
| pubiflora | 2 | |
| rinorea | 1 | 6.7% |
| var | 1 | 6.7% |
| agissea | 1 | 6.7% |
| simulans | 1 | 6.7% |
| colpomenia | 1 | 6.7% |
| sinuosa | 1 | 6.7% |
| laurencia | 1 | 6.7% |
| intricata | 1 | 6.7% |
| avrainvillea | 1 | 6.7% |
| Other values (4) | 4 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 20 | |
| a | 16 | |
| n | 14 | |
| e | 10 | 7.4% |
| s | 8 | 5.9% |
| r | 8 | 5.9% |
| 8 | 5.9% | |
| o | 7 | 5.2% |
| l | 7 | 5.2% |
| u | 5 | 3.7% |
| Other values (16) | 32 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 120 | |
| Space Separator | 8 | 5.9% |
| Uppercase Letter | 6 | 4.4% |
| Other Punctuation | 1 | 0.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 20 | |
| a | 16 | |
| n | 14 | |
| e | 10 | |
| s | 8 | 6.7% |
| r | 8 | 6.7% |
| o | 7 | 5.8% |
| l | 7 | 5.8% |
| u | 5 | 4.2% |
| m | 4 | 3.3% |
| Other values (10) | 21 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 3 | |
| R | 1 | 16.7% |
| L | 1 | 16.7% |
| C | 1 | 16.7% |
Space Separator
| Value | Count | Frequency (%) |
| 8 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 126 | |
| Common | 9 | 6.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 20 | |
| a | 16 | |
| n | 14 | |
| e | 10 | 7.9% |
| s | 8 | 6.3% |
| r | 8 | 6.3% |
| o | 7 | 5.6% |
| l | 7 | 5.6% |
| u | 5 | 4.0% |
| m | 4 | 3.2% |
| Other values (14) | 27 |
Common
| Value | Count | Frequency (%) |
| 8 | ||
| . | 1 | 11.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 135 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 20 | |
| a | 16 | |
| n | 14 | |
| e | 10 | 7.4% |
| s | 8 | 5.9% |
| r | 8 | 5.9% |
| 8 | 5.9% | |
| o | 7 | 5.2% |
| l | 7 | 5.2% |
| u | 5 | 3.7% |
| Other values (16) | 32 |
bed
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 4515660 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 17 |
|---|---|
| Median length | 17 |
| Mean length | 17 |
| Min length | 17 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Riccardia pinguis |
|---|
| Value | Count | Frequency (%) |
| riccardia | 1 | |
| pinguis | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 4 | |
| c | 2 | |
| a | 2 | |
| R | 1 | 5.9% |
| r | 1 | 5.9% |
| d | 1 | 5.9% |
| 1 | 5.9% | |
| p | 1 | 5.9% |
| n | 1 | 5.9% |
| g | 1 | 5.9% |
| Other values (2) | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 15 | |
| Uppercase Letter | 1 | 5.9% |
| Space Separator | 1 | 5.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 4 | |
| c | 2 | |
| a | 2 | |
| r | 1 | 6.7% |
| d | 1 | 6.7% |
| p | 1 | 6.7% |
| n | 1 | 6.7% |
| g | 1 | 6.7% |
| u | 1 | 6.7% |
| s | 1 | 6.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 16 | |
| Common | 1 | 5.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 4 | |
| c | 2 | |
| a | 2 | |
| R | 1 | 6.2% |
| r | 1 | 6.2% |
| d | 1 | 6.2% |
| p | 1 | 6.2% |
| n | 1 | 6.2% |
| g | 1 | 6.2% |
| u | 1 | 6.2% |
Common
| Value | Count | Frequency (%) |
| 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 4 | |
| c | 2 | |
| a | 2 | |
| R | 1 | 5.9% |
| r | 1 | 5.9% |
| d | 1 | 5.9% |
| 1 | 5.9% | |
| p | 1 | 5.9% |
| n | 1 | 5.9% |
| g | 1 | 5.9% |
| Other values (2) | 2 |
identificationID
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 4515660 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Variety |
|---|
| Value | Count | Frequency (%) |
| variety | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| V | 1 | |
| a | 1 | |
| r | 1 | |
| i | 1 | |
| e | 1 | |
| t | 1 | |
| y | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6 | |
| Uppercase Letter | 1 | 14.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1 | |
| r | 1 | |
| i | 1 | |
| e | 1 | |
| t | 1 | |
| y | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| V | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| V | 1 | |
| a | 1 | |
| r | 1 | |
| i | 1 | |
| e | 1 | |
| t | 1 | |
| y | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| V | 1 | |
| a | 1 | |
| r | 1 | |
| i | 1 | |
| e | 1 | |
| t | 1 | |
| y | 1 |
Missing 
| Distinct | 20 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 4504655 |
| Missing (%) | 99.8% |
| Memory size | 34.5 MiB |
Length
| Max length | 31 |
|---|---|
| Median length | 3 |
| Mean length | 4.359985462 |
| Min length | 2 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | cf. |
|---|---|
| 2nd row | cf. |
| 3rd row | cf. |
| 4th row | vel aff. |
| 5th row | vel aff. |
| Value | Count | Frequency (%) |
| cf | 5895 | |
| aff | 2849 | |
| uncertain | 1610 | 14.0% |
| s.l | 543 | 4.7% |
| vel | 347 | 3.0% |
| near | 76 | 0.7% |
| sp | 64 | 0.6% |
| nov | 42 | 0.4% |
| s.s | 27 | 0.2% |
| l | 2 | < 0.1% |
| Other values (7) | 7 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| f | 11593 | |
| . | 9933 | |
| c | 7505 | |
| a | 4536 | 9.5% |
| n | 3340 | 7.0% |
| e | 2034 | 4.2% |
| r | 1686 | 3.5% |
| t | 1613 | 3.4% |
| i | 1611 | 3.4% |
| u | 1601 | 3.3% |
| Other values (19) | 2534 | 5.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 37572 | |
| Other Punctuation | 9934 | 20.7% |
| Space Separator | 456 | 1.0% |
| Uppercase Letter | 20 | < 0.1% |
| Open Punctuation | 2 | < 0.1% |
| Close Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| f | 11593 | |
| c | 7505 | |
| a | 4536 | 12.1% |
| n | 3340 | 8.9% |
| e | 2034 | 5.4% |
| r | 1686 | 4.5% |
| t | 1613 | 4.3% |
| i | 1611 | 4.3% |
| u | 1601 | 4.3% |
| l | 890 | 2.4% |
| Other values (7) | 1163 | 3.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 10 | |
| L | 4 | 20.0% |
| K | 2 | 10.0% |
| H | 1 | 5.0% |
| P | 1 | 5.0% |
| E | 1 | 5.0% |
| S | 1 | 5.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 9933 | |
| & | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 456 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 37592 | |
| Common | 10394 | 21.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| f | 11593 | |
| c | 7505 | |
| a | 4536 | 12.1% |
| n | 3340 | 8.9% |
| e | 2034 | 5.4% |
| r | 1686 | 4.5% |
| t | 1613 | 4.3% |
| i | 1611 | 4.3% |
| u | 1601 | 4.3% |
| l | 890 | 2.4% |
| Other values (14) | 1183 | 3.1% |
Common
| Value | Count | Frequency (%) |
| . | 9933 | |
| 456 | 4.4% | |
| ( | 2 | < 0.1% |
| ) | 2 | < 0.1% |
| & | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 47986 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| f | 11593 | |
| . | 9933 | |
| c | 7505 | |
| a | 4536 | 9.5% |
| n | 3340 | 7.0% |
| e | 2034 | 4.2% |
| r | 1686 | 3.5% |
| t | 1613 | 3.4% |
| i | 1611 | 3.4% |
| u | 1601 | 3.3% |
| Other values (19) | 2534 | 5.3% |
typeStatus
Text
Missing 
| Distinct | 192 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 4399315 |
| Missing (%) | 97.4% |
| Memory size | 34.5 MiB |
Length
| Max length | 60 |
|---|---|
| Median length | 7 |
| Mean length | 8.824067867 |
| Min length | 4 |
Unique
| Unique | 70 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Isotype |
|---|---|
| 2nd row | Isotype |
| 3rd row | Holotype |
| 4th row | Type Collection |
| 5th row | Type Collection |
| Value | Count | Frequency (%) |
| isotype | 61603 | |
| holotype | 19964 | 14.6% |
| type | 16845 | 12.3% |
| collection | 9779 | 7.2% |
| isosyntype | 7051 | 5.2% |
| syntype | 6301 | 4.6% |
| fragment | 5516 | 4.0% |
| isolectotype | 2869 | 2.1% |
| possible | 2534 | 1.9% |
| lectotype | 1378 | 1.0% |
| Other values (16) | 2653 | 1.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 141371 | |
| o | 139676 | |
| y | 131094 | |
| t | 121736 | |
| p | 117820 | |
| s | 84712 | |
| I | 71992 | |
| l | 46312 | 4.5% |
| n | 29146 | 2.8% |
| 20147 | 2.0% | |
| Other values (26) | 122639 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 868423 | |
| Uppercase Letter | 136492 | 13.3% |
| Space Separator | 20147 | 2.0% |
| Other Punctuation | 1581 | 0.2% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 141371 | |
| o | 139676 | |
| y | 131094 | |
| t | 121736 | |
| p | 117820 | |
| s | 84712 | |
| l | 46312 | 5.3% |
| n | 29146 | 3.4% |
| c | 14142 | 1.6% |
| i | 13202 | 1.5% |
| Other values (9) | 29212 | 3.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 71992 | |
| H | 19964 | 14.6% |
| T | 16846 | 12.3% |
| C | 10393 | 7.6% |
| S | 6301 | 4.6% |
| F | 5516 | 4.0% |
| P | 2922 | 2.1% |
| L | 1378 | 1.0% |
| M | 703 | 0.5% |
| N | 275 | 0.2% |
| Other values (3) | 202 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 20147 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 1581 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1004915 | |
| Common | 21730 | 2.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 141371 | |
| o | 139676 | |
| y | 131094 | |
| t | 121736 | |
| p | 117820 | |
| s | 84712 | |
| I | 71992 | |
| l | 46312 | 4.6% |
| n | 29146 | 2.9% |
| H | 19964 | 2.0% |
| Other values (22) | 101092 |
Common
| Value | Count | Frequency (%) |
| 20147 | ||
| ; | 1581 | 7.3% |
| ( | 1 | < 0.1% |
| ) | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1026645 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 141371 | |
| o | 139676 | |
| y | 131094 | |
| t | 121736 | |
| p | 117820 | |
| s | 84712 | |
| I | 71992 | |
| l | 46312 | 4.5% |
| n | 29146 | 2.8% |
| 20147 | 2.0% | |
| Other values (26) | 122639 |
identifiedBy
Text
Missing 
| Distinct | 8134 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 3958097 |
| Missing (%) | 87.7% |
| Memory size | 34.5 MiB |
Length
| Max length | 131 |
|---|---|
| Median length | 109 |
| Mean length | 37.72206957 |
| Min length | 2 |
Unique
| Unique | 2641 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | Blair, S. M. |
|---|---|
| 2nd row | Acevedo-Rodríguez, P., (BOT), Smithsonian Institution - National Museum of Natural History (UNITED STATES) |
| 3rd row | Acevedo-Rodríguez, P., (BOT), Smithsonian Institution - National Museum of Natural History (UNITED STATES) |
| 4th row | Wagner, W. L., (BOT), Smithsonian Institution - National Museum of Natural History (UNITED STATES) |
| 5th row | Wagner, W. L., (BOT), Smithsonian Institution - National Museum of Natural History (UNITED STATES) |
| Value | Count | Frequency (%) |
| united | 136541 | 4.2% |
| states | 136495 | 4.2% |
| of | 126287 | 3.8% |
| 123596 | 3.8% | |
| national | 120671 | 3.7% |
| museum | 119573 | 3.6% |
| smithsonian | 118957 | 3.6% |
| natural | 118768 | 3.6% |
| history | 118661 | 3.6% |
| institution | 118645 | 3.6% |
| Other values (6317) | 2046516 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2727146 | 13.0% | |
| t | 1181664 | 5.6% |
| a | 1144451 | 5.4% |
| o | 1117247 | 5.3% |
| i | 1047010 | 5.0% |
| n | 1030069 | 4.9% |
| , | 907900 | 4.3% |
| . | 858290 | 4.1% |
| r | 853980 | 4.1% |
| e | 836095 | 4.0% |
| Other values (88) | 9328616 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10959399 | |
| Uppercase Letter | 4749667 | |
| Space Separator | 2727146 | 13.0% |
| Other Punctuation | 1794634 | 8.5% |
| Open Punctuation | 323090 | 1.5% |
| Close Punctuation | 323090 | 1.5% |
| Dash Punctuation | 155421 | 0.7% |
| Decimal Number | 20 | < 0.1% |
| Currency Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 1181664 | |
| a | 1144451 | |
| o | 1117247 | |
| i | 1047010 | |
| n | 1030069 | |
| r | 853980 | |
| e | 836095 | |
| u | 685477 | 6.3% |
| s | 684648 | 6.2% |
| l | 499617 | 4.6% |
| Other values (36) | 1879141 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 615333 | |
| T | 580486 | |
| N | 453167 | 9.5% |
| E | 392588 | 8.3% |
| I | 286588 | 6.0% |
| A | 283630 | 6.0% |
| M | 267816 | 5.6% |
| D | 257036 | 5.4% |
| U | 221269 | 4.7% |
| H | 211199 | 4.4% |
| Other values (21) | 1180555 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 907900 | |
| . | 858290 | |
| ; | 26380 | 1.5% |
| " | 1134 | 0.1% |
| ' | 677 | < 0.1% |
| & | 187 | < 0.1% |
| ¡ | 40 | < 0.1% |
| / | 17 | < 0.1% |
| ? | 8 | < 0.1% |
| … | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 8 | |
| 0 | 6 | |
| 9 | 4 | |
| 1 | 2 | 10.0% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 320857 | |
| [ | 2233 | 0.7% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 320857 | |
| ] | 2233 | 0.7% |
Space Separator
| Value | Count | Frequency (%) |
| 2727146 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 155421 |
Currency Symbol
| Value | Count | Frequency (%) |
| ¢ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 15709066 | |
| Common | 5323402 | 25.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 1181664 | 7.5% |
| a | 1144451 | 7.3% |
| o | 1117247 | 7.1% |
| i | 1047010 | 6.7% |
| n | 1030069 | 6.6% |
| r | 853980 | 5.4% |
| e | 836095 | 5.3% |
| u | 685477 | 4.4% |
| s | 684648 | 4.4% |
| S | 615333 | 3.9% |
| Other values (67) | 6513092 |
Common
| Value | Count | Frequency (%) |
| 2727146 | ||
| , | 907900 | 17.1% |
| . | 858290 | 16.1% |
| ( | 320857 | 6.0% |
| ) | 320857 | 6.0% |
| - | 155421 | 2.9% |
| ; | 26380 | 0.5% |
| [ | 2233 | < 0.1% |
| ] | 2233 | < 0.1% |
| " | 1134 | < 0.1% |
| Other values (11) | 951 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 21000075 | |
| None | 32392 | 0.2% |
| Punctuation | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2727146 | 13.0% | |
| t | 1181664 | 5.6% |
| a | 1144451 | 5.4% |
| o | 1117247 | 5.3% |
| i | 1047010 | 5.0% |
| n | 1030069 | 4.9% |
| , | 907900 | 4.3% |
| . | 858290 | 4.1% |
| r | 853980 | 4.1% |
| e | 836095 | 4.0% |
| Other values (60) | 9296223 |
None
| Value | Count | Frequency (%) |
| í | 18313 | |
| á | 3710 | 11.5% |
| é | 3312 | 10.2% |
| ö | 1488 | 4.6% |
| ñ | 1400 | 4.3% |
| ü | 1088 | 3.4% |
| ó | 994 | 3.1% |
| ä | 828 | 2.6% |
| ú | 332 | 1.0% |
| ã | 297 | 0.9% |
| Other values (17) | 630 | 1.9% |
Punctuation
| Value | Count | Frequency (%) |
| … | 1 |
identifiedByID
Text
Missing 
| Distinct | 6 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 4515655 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 71 |
|---|---|
| Median length | 65.5 |
| Mean length | 65.5 |
| Min length | 59 |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Plantae, Dicotyledonae, Malpighiales, Violaceae, Violoideae |
|---|---|
| 2nd row | Plantae, Rhodophyta, Florideophyceae, Peyssonneliales, Peyssonneliaceae |
| 3rd row | Chromista, Ochrophyta, Phaeophyceae, Ectocarpales, Scytosiphonaceae |
| 4th row | Plantae, Rhodophyta, Florideophyceae, Ceramiales, Rhodomelaceae |
| 5th row | Plantae, Chlorophyta, Ulvophyceae, Bryopsidales, Dichotomosiphonaceae |
| Value | Count | Frequency (%) |
| plantae | 5 | |
| rhodophyta | 2 | 6.7% |
| florideophyceae | 2 | 6.7% |
| ulvophyceae | 2 | 6.7% |
| chlorophyta | 2 | 6.7% |
| ectocarpales | 1 | 3.3% |
| cladophorales | 1 | 3.3% |
| dichotomosiphonaceae | 1 | 3.3% |
| bryopsidales | 1 | 3.3% |
| rhodomelaceae | 1 | 3.3% |
| Other values (12) | 12 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 47 | |
| e | 47 | |
| o | 37 | 9.4% |
| l | 25 | 6.4% |
| , | 24 | 6.1% |
| 24 | 6.1% | |
| h | 23 | 5.9% |
| c | 17 | 4.3% |
| i | 16 | 4.1% |
| y | 16 | 4.1% |
| Other values (22) | 117 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 315 | |
| Uppercase Letter | 30 | 7.6% |
| Other Punctuation | 24 | 6.1% |
| Space Separator | 24 | 6.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 47 | |
| e | 47 | |
| o | 37 | |
| l | 25 | |
| h | 23 | |
| c | 17 | 5.4% |
| i | 16 | 5.1% |
| y | 16 | 5.1% |
| p | 16 | 5.1% |
| t | 15 | 4.8% |
| Other values (7) | 56 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 8 | |
| C | 5 | |
| R | 3 | 10.0% |
| V | 2 | 6.7% |
| F | 2 | 6.7% |
| D | 2 | 6.7% |
| U | 2 | 6.7% |
| B | 1 | 3.3% |
| S | 1 | 3.3% |
| E | 1 | 3.3% |
| Other values (3) | 3 | 10.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 24 |
Space Separator
| Value | Count | Frequency (%) |
| 24 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 345 | |
| Common | 48 | 12.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 47 | |
| e | 47 | |
| o | 37 | |
| l | 25 | 7.2% |
| h | 23 | 6.7% |
| c | 17 | 4.9% |
| i | 16 | 4.6% |
| y | 16 | 4.6% |
| p | 16 | 4.6% |
| t | 15 | 4.3% |
| Other values (20) | 86 |
Common
| Value | Count | Frequency (%) |
| , | 24 | |
| 24 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 393 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 47 | |
| e | 47 | |
| o | 37 | 9.4% |
| l | 25 | 6.4% |
| , | 24 | 6.1% |
| 24 | 6.1% | |
| h | 23 | 5.9% |
| c | 17 | 4.3% |
| i | 16 | 4.1% |
| y | 16 | 4.1% |
| Other values (22) | 117 |
dateIdentified
Text
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 42.9% |
| Missing | 4515654 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 69 |
|---|---|
| Median length | 7 |
| Mean length | 16.14285714 |
| Min length | 7 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 28.6% |
Sample
| 1st row | Plantae |
|---|---|
| 2nd row | Plantae |
| 3rd row | Chromista |
| 4th row | Plantae |
| 5th row | Plantae |
| Value | Count | Frequency (%) |
| plantae | 6 | |
| chromista | 1 | 9.1% |
| marchantiophyta | 1 | 9.1% |
| jungermanniopsida | 1 | 9.1% |
| metzgeriales | 1 | 9.1% |
| aneuraceae | 1 | 9.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 21 | |
| e | 13 | |
| n | 11 | |
| t | 10 | 8.8% |
| l | 7 | 6.2% |
| P | 6 | 5.3% |
| i | 5 | 4.4% |
| r | 5 | 4.4% |
| 4 | 3.5% | |
| , | 4 | 3.5% |
| Other values (15) | 27 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 94 | |
| Uppercase Letter | 11 | 9.7% |
| Space Separator | 4 | 3.5% |
| Other Punctuation | 4 | 3.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 21 | |
| e | 13 | |
| n | 11 | |
| t | 10 | |
| l | 7 | 7.4% |
| i | 5 | 5.3% |
| r | 5 | 5.3% |
| s | 3 | 3.2% |
| o | 3 | 3.2% |
| h | 3 | 3.2% |
| Other values (8) | 13 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 6 | |
| M | 2 | 18.2% |
| C | 1 | 9.1% |
| J | 1 | 9.1% |
| A | 1 | 9.1% |
Space Separator
| Value | Count | Frequency (%) |
| 4 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 105 | |
| Common | 8 | 7.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 21 | |
| e | 13 | |
| n | 11 | |
| t | 10 | |
| l | 7 | 6.7% |
| P | 6 | 5.7% |
| i | 5 | 4.8% |
| r | 5 | 4.8% |
| s | 3 | 2.9% |
| o | 3 | 2.9% |
| Other values (13) | 21 |
Common
| Value | Count | Frequency (%) |
| 4 | ||
| , | 4 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 113 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 21 | |
| e | 13 | |
| n | 11 | |
| t | 10 | 8.8% |
| l | 7 | 6.2% |
| P | 6 | 5.3% |
| i | 5 | 4.4% |
| r | 5 | 4.4% |
| 4 | 3.5% | |
| , | 4 | 3.5% |
| Other values (15) | 27 |
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 66.7% |
| Missing | 4515655 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 10.5 |
| Mean length | 9.833333333 |
| Min length | 7 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 33.3% |
Sample
| 1st row | Rhodophyta |
|---|---|
| 2nd row | Ochrophyta |
| 3rd row | Rhodophyta |
| 4th row | Chlorophyta |
| 5th row | Chlorophyta |
| Value | Count | Frequency (%) |
| rhodophyta | 2 | |
| chlorophyta | 2 | |
| ochrophyta | 1 | |
| plantae | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| h | 10 | |
| o | 9 | |
| a | 7 | |
| t | 6 | |
| p | 5 | |
| y | 5 | |
| l | 3 | 5.1% |
| r | 3 | 5.1% |
| R | 2 | 3.4% |
| d | 2 | 3.4% |
| Other values (6) | 7 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 53 | |
| Uppercase Letter | 6 | 10.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| h | 10 | |
| o | 9 | |
| a | 7 | |
| t | 6 | |
| p | 5 | |
| y | 5 | |
| l | 3 | 5.7% |
| r | 3 | 5.7% |
| d | 2 | 3.8% |
| c | 1 | 1.9% |
| Other values (2) | 2 | 3.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 2 | |
| C | 2 | |
| O | 1 | |
| P | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 59 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| h | 10 | |
| o | 9 | |
| a | 7 | |
| t | 6 | |
| p | 5 | |
| y | 5 | |
| l | 3 | 5.1% |
| r | 3 | 5.1% |
| R | 2 | 3.4% |
| d | 2 | 3.4% |
| Other values (6) | 7 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 59 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| h | 10 | |
| o | 9 | |
| a | 7 | |
| t | 6 | |
| p | 5 | |
| y | 5 | |
| l | 3 | 5.1% |
| r | 3 | 5.1% |
| R | 2 | 3.4% |
| d | 2 | 3.4% |
| Other values (6) | 7 |
identificationVerificationStatus
Text
Missing 
| Distinct | 5 |
|---|---|
| Distinct (%) | 71.4% |
| Missing | 4515654 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 15 |
|---|---|
| Median length | 13 |
| Mean length | 13.14285714 |
| Min length | 11 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 42.9% |
Sample
| 1st row | Dicotyledonae |
|---|---|
| 2nd row | Florideophyceae |
| 3rd row | Phaeophyceae |
| 4th row | Florideophyceae |
| 5th row | Ulvophyceae |
| Value | Count | Frequency (%) |
| florideophyceae | 2 | |
| ulvophyceae | 2 | |
| dicotyledonae | 1 | |
| phaeophyceae | 1 | |
| marchantiophyta | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 15 | |
| o | 10 | |
| a | 10 | |
| h | 8 | |
| y | 7 | |
| c | 7 | |
| p | 6 | 6.5% |
| l | 5 | 5.4% |
| i | 4 | 4.3% |
| t | 3 | 3.3% |
| Other values (9) | 17 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 85 | |
| Uppercase Letter | 7 | 7.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 15 | |
| o | 10 | |
| a | 10 | |
| h | 8 | |
| y | 7 | |
| c | 7 | |
| p | 6 | 7.1% |
| l | 5 | 5.9% |
| i | 4 | 4.7% |
| t | 3 | 3.5% |
| Other values (4) | 10 |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 2 | |
| U | 2 | |
| D | 1 | |
| P | 1 | |
| M | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 92 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 15 | |
| o | 10 | |
| a | 10 | |
| h | 8 | |
| y | 7 | |
| c | 7 | |
| p | 6 | 6.5% |
| l | 5 | 5.4% |
| i | 4 | 4.3% |
| t | 3 | 3.3% |
| Other values (9) | 17 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 92 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 15 | |
| o | 10 | |
| a | 10 | |
| h | 8 | |
| y | 7 | |
| c | 7 | |
| p | 6 | 6.5% |
| l | 5 | 5.4% |
| i | 4 | 4.3% |
| t | 3 | 3.3% |
| Other values (9) | 17 |
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 4515654 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 17 |
|---|---|
| Median length | 15 |
| Mean length | 13 |
| Min length | 10 |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Malpighiales |
|---|---|
| 2nd row | Peyssonneliales |
| 3rd row | Ectocarpales |
| 4th row | Ceramiales |
| 5th row | Bryopsidales |
| Value | Count | Frequency (%) |
| malpighiales | 1 | |
| peyssonneliales | 1 | |
| ectocarpales | 1 | |
| ceramiales | 1 | |
| bryopsidales | 1 | |
| cladophorales | 1 | |
| jungermanniopsida | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 12 | |
| e | 10 | |
| s | 10 | |
| l | 9 | |
| i | 7 | 7.7% |
| o | 6 | 6.6% |
| p | 5 | 5.5% |
| r | 5 | 5.5% |
| n | 5 | 5.5% |
| d | 3 | 3.3% |
| Other values (13) | 19 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 84 | |
| Uppercase Letter | 7 | 7.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 12 | |
| e | 10 | |
| s | 10 | |
| l | 9 | |
| i | 7 | |
| o | 6 | |
| p | 5 | |
| r | 5 | |
| n | 5 | |
| d | 3 | 3.6% |
| Other values (7) | 12 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 2 | |
| B | 1 | |
| J | 1 | |
| M | 1 | |
| E | 1 | |
| P | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 91 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 12 | |
| e | 10 | |
| s | 10 | |
| l | 9 | |
| i | 7 | 7.7% |
| o | 6 | 6.6% |
| p | 5 | 5.5% |
| r | 5 | 5.5% |
| n | 5 | 5.5% |
| d | 3 | 3.3% |
| Other values (13) | 19 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 91 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 12 | |
| e | 10 | |
| s | 10 | |
| l | 9 | |
| i | 7 | 7.7% |
| o | 6 | 6.6% |
| p | 5 | 5.5% |
| r | 5 | 5.5% |
| n | 5 | 5.5% |
| d | 3 | 3.3% |
| Other values (13) | 19 |
taxonID
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 4515660 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 12 |
| Mean length | 12 |
| Min length | 12 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Metzgeriales |
|---|
| Value | Count | Frequency (%) |
| metzgeriales | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 3 | |
| M | 1 | 8.3% |
| t | 1 | 8.3% |
| z | 1 | 8.3% |
| g | 1 | 8.3% |
| r | 1 | 8.3% |
| i | 1 | 8.3% |
| a | 1 | 8.3% |
| l | 1 | 8.3% |
| s | 1 | 8.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 11 | |
| Uppercase Letter | 1 | 8.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 3 | |
| t | 1 | 9.1% |
| z | 1 | 9.1% |
| g | 1 | 9.1% |
| r | 1 | 9.1% |
| i | 1 | 9.1% |
| a | 1 | 9.1% |
| l | 1 | 9.1% |
| s | 1 | 9.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 3 | |
| M | 1 | 8.3% |
| t | 1 | 8.3% |
| z | 1 | 8.3% |
| g | 1 | 8.3% |
| r | 1 | 8.3% |
| i | 1 | 8.3% |
| a | 1 | 8.3% |
| l | 1 | 8.3% |
| s | 1 | 8.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 3 | |
| M | 1 | 8.3% |
| t | 1 | 8.3% |
| z | 1 | 8.3% |
| g | 1 | 8.3% |
| r | 1 | 8.3% |
| i | 1 | 8.3% |
| a | 1 | 8.3% |
| l | 1 | 8.3% |
| s | 1 | 8.3% |
scientificNameID
Text
Missing 
| Distinct | 6 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 4515655 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 20 |
|---|---|
| Median length | 15 |
| Mean length | 14.66666667 |
| Min length | 9 |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Violaceae |
|---|---|
| 2nd row | Peyssonneliaceae |
| 3rd row | Scytosiphonaceae |
| 4th row | Rhodomelaceae |
| 5th row | Dichotomosiphonaceae |
| Value | Count | Frequency (%) |
| violaceae | 1 | |
| peyssonneliaceae | 1 | |
| scytosiphonaceae | 1 | |
| rhodomelaceae | 1 | |
| dichotomosiphonaceae | 1 | |
| anadyomenaceae | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 16 | |
| a | 13 | |
| o | 11 | |
| c | 8 | |
| n | 6 | 6.8% |
| i | 5 | 5.7% |
| s | 4 | 4.5% |
| h | 4 | 4.5% |
| l | 3 | 3.4% |
| y | 3 | 3.4% |
| Other values (10) | 15 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 82 | |
| Uppercase Letter | 6 | 6.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 16 | |
| a | 13 | |
| o | 11 | |
| c | 8 | |
| n | 6 | 7.3% |
| i | 5 | 6.1% |
| s | 4 | 4.9% |
| h | 4 | 4.9% |
| l | 3 | 3.7% |
| y | 3 | 3.7% |
| Other values (4) | 9 |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 1 | |
| V | 1 | |
| R | 1 | |
| S | 1 | |
| P | 1 | |
| A | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 88 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 16 | |
| a | 13 | |
| o | 11 | |
| c | 8 | |
| n | 6 | 6.8% |
| i | 5 | 5.7% |
| s | 4 | 4.5% |
| h | 4 | 4.5% |
| l | 3 | 3.4% |
| y | 3 | 3.4% |
| Other values (10) | 15 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 88 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 16 | |
| a | 13 | |
| o | 11 | |
| c | 8 | |
| n | 6 | 6.8% |
| i | 5 | 5.7% |
| s | 4 | 4.5% |
| h | 4 | 4.5% |
| l | 3 | 3.4% |
| y | 3 | 3.4% |
| Other values (10) | 15 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 4515660 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Aneuraceae |
|---|
| Value | Count | Frequency (%) |
| aneuraceae | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 3 | |
| a | 2 | |
| A | 1 | 10.0% |
| n | 1 | 10.0% |
| u | 1 | 10.0% |
| r | 1 | 10.0% |
| c | 1 | 10.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9 | |
| Uppercase Letter | 1 | 10.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 3 | |
| a | 2 | |
| n | 1 | 11.1% |
| u | 1 | 11.1% |
| r | 1 | 11.1% |
| c | 1 | 11.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 3 | |
| a | 2 | |
| A | 1 | 10.0% |
| n | 1 | 10.0% |
| u | 1 | 10.0% |
| r | 1 | 10.0% |
| c | 1 | 10.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 3 | |
| a | 2 | |
| A | 1 | 10.0% |
| n | 1 | 10.0% |
| u | 1 | 10.0% |
| r | 1 | 10.0% |
| c | 1 | 10.0% |
Missing 
| Distinct | 6 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 4515655 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 10 |
| Mean length | 9.166666667 |
| Min length | 7 |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Rinorea |
|---|---|
| 2nd row | Agissea |
| 3rd row | Colpomenia |
| 4th row | Laurencia |
| 5th row | Avrainvillea |
| Value | Count | Frequency (%) |
| rinorea | 1 | |
| agissea | 1 | |
| colpomenia | 1 | |
| laurencia | 1 | |
| avrainvillea | 1 | |
| anadyomene | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 8 | |
| e | 7 | |
| n | 6 | |
| i | 6 | |
| o | 4 | 7.3% |
| r | 3 | 5.5% |
| A | 3 | 5.5% |
| l | 3 | 5.5% |
| s | 2 | 3.6% |
| v | 2 | 3.6% |
| Other values (10) | 11 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 49 | |
| Uppercase Letter | 6 | 10.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 8 | |
| e | 7 | |
| n | 6 | |
| i | 6 | |
| o | 4 | |
| r | 3 | 6.1% |
| l | 3 | 6.1% |
| s | 2 | 4.1% |
| v | 2 | 4.1% |
| m | 2 | 4.1% |
| Other values (6) | 6 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 3 | |
| R | 1 | 16.7% |
| C | 1 | 16.7% |
| L | 1 | 16.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 55 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 8 | |
| e | 7 | |
| n | 6 | |
| i | 6 | |
| o | 4 | 7.3% |
| r | 3 | 5.5% |
| A | 3 | 5.5% |
| l | 3 | 5.5% |
| s | 2 | 3.6% |
| v | 2 | 3.6% |
| Other values (10) | 11 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 55 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 8 | |
| e | 7 | |
| n | 6 | |
| i | 6 | |
| o | 4 | 7.3% |
| r | 3 | 5.5% |
| A | 3 | 5.5% |
| l | 3 | 5.5% |
| s | 2 | 3.6% |
| v | 2 | 3.6% |
| Other values (10) | 11 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 4515660 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 9 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Riccardia |
|---|
| Value | Count | Frequency (%) |
| riccardia | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 2 | |
| c | 2 | |
| a | 2 | |
| R | 1 | |
| r | 1 | |
| d | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8 | |
| Uppercase Letter | 1 | 11.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 2 | |
| c | 2 | |
| a | 2 | |
| r | 1 | |
| d | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 2 | |
| c | 2 | |
| a | 2 | |
| R | 1 | |
| r | 1 | |
| d | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 2 | |
| c | 2 | |
| a | 2 | |
| R | 1 | |
| r | 1 | |
| d | 1 |
scientificName
Text
| Distinct | 330689 |
|---|---|
| Distinct (%) | 7.3% |
| Missing | 13722 |
| Missing (%) | 0.3% |
| Memory size | 34.5 MiB |
Length
| Max length | 136 |
|---|---|
| Median length | 98 |
| Mean length | 19.78998916 |
| Min length | 5 |
Unique
| Unique | 121029 ? |
|---|---|
| Unique (%) | 2.7% |
Sample
| 1st row | Lithothamnion calcareum |
|---|---|
| 2nd row | Amicia glandulosa |
| 3rd row | Tripogandra glandulosa |
| 4th row | Connarus steyermarkii |
| 5th row | Trichoneura grandiglumis |
| Value | Count | Frequency (%) |
| sp | 270265 | 2.8% |
| var | 210171 | 2.2% |
| subsp | 106168 | 1.1% |
| carex | 58129 | 0.6% |
| indet | 41306 | 0.4% |
| poa | 30191 | 0.3% |
| cyperus | 27842 | 0.3% |
| cladonia | 27025 | 0.3% |
| paspalum | 26142 | 0.3% |
| solanum | 24852 | 0.3% |
| Other values (98717) | 8921619 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 10712057 | 12.0% |
| i | 8471292 | 9.5% |
| e | 5756840 | 6.5% |
| s | 5576870 | 6.3% |
| r | 5526161 | 6.2% |
| 5241771 | 5.9% | |
| o | 5207769 | 5.8% |
| l | 4893187 | 5.5% |
| n | 4698467 | 5.3% |
| u | 4559281 | 5.1% |
| Other values (87) | 28449629 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 78580357 | |
| Space Separator | 5241771 | 5.9% |
| Uppercase Letter | 4541706 | 5.1% |
| Other Punctuation | 688084 | 0.8% |
| Dash Punctuation | 21295 | < 0.1% |
| Decimal Number | 7089 | < 0.1% |
| Open Punctuation | 6499 | < 0.1% |
| Close Punctuation | 6499 | < 0.1% |
| Math Symbol | 24 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 10712057 | |
| i | 8471292 | |
| e | 5756840 | 7.3% |
| s | 5576870 | 7.1% |
| r | 5526161 | 7.0% |
| o | 5207769 | 6.6% |
| l | 4893187 | 6.2% |
| n | 4698467 | 6.0% |
| u | 4559281 | 5.8% |
| t | 3890322 | 5.0% |
| Other values (29) | 19288111 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 644736 | |
| P | 590845 | |
| S | 445961 | |
| A | 406199 | 8.9% |
| M | 289296 | 6.4% |
| L | 244144 | 5.4% |
| E | 239459 | 5.3% |
| D | 213054 | 4.7% |
| B | 194553 | 4.3% |
| H | 183912 | 4.0% |
| Other values (19) | 1089547 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 675591 | |
| , | 5059 | 0.7% |
| ' | 4244 | 0.6% |
| & | 2296 | 0.3% |
| ? | 530 | 0.1% |
| " | 206 | < 0.1% |
| / | 145 | < 0.1% |
| # | 11 | < 0.1% |
| § | 1 | < 0.1% |
| * | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2215 | |
| 0 | 1692 | |
| 1 | 1572 | |
| 5 | 1049 | |
| 9 | 142 | 2.0% |
| 3 | 125 | 1.8% |
| 7 | 110 | 1.6% |
| 8 | 103 | 1.5% |
| 6 | 52 | 0.7% |
| 4 | 29 | 0.4% |
Math Symbol
| Value | Count | Frequency (%) |
| × | 20 | |
| ~ | 3 | 12.5% |
| + | 1 | 4.2% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 6483 | |
| [ | 16 | 0.2% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 6483 | |
| ] | 16 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 5241771 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 21295 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 83122063 | |
| Common | 5971261 | 6.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 10712057 | |
| i | 8471292 | 10.2% |
| e | 5756840 | 6.9% |
| s | 5576870 | 6.7% |
| r | 5526161 | 6.6% |
| o | 5207769 | 6.3% |
| l | 4893187 | 5.9% |
| n | 4698467 | 5.7% |
| u | 4559281 | 5.5% |
| t | 3890322 | 4.7% |
| Other values (58) | 23829817 |
Common
| Value | Count | Frequency (%) |
| 5241771 | ||
| . | 675591 | 11.3% |
| - | 21295 | 0.4% |
| ( | 6483 | 0.1% |
| ) | 6483 | 0.1% |
| , | 5059 | 0.1% |
| ' | 4244 | 0.1% |
| & | 2296 | < 0.1% |
| 2 | 2215 | < 0.1% |
| 0 | 1692 | < 0.1% |
| Other values (19) | 4132 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 89092107 | |
| None | 1217 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 10712057 | 12.0% |
| i | 8471292 | 9.5% |
| e | 5756840 | 6.5% |
| s | 5576870 | 6.3% |
| r | 5526161 | 6.2% |
| 5241771 | 5.9% | |
| o | 5207769 | 5.8% |
| l | 4893187 | 5.5% |
| n | 4698467 | 5.3% |
| u | 4559281 | 5.1% |
| Other values (69) | 28448412 |
None
| Value | Count | Frequency (%) |
| ë | 762 | |
| á | 110 | 9.0% |
| ö | 107 | 8.8% |
| ü | 88 | 7.2% |
| Á | 45 | 3.7% |
| é | 39 | 3.2% |
| ó | 26 | 2.1% |
| × | 20 | 1.6% |
| É | 9 | 0.7% |
| Ø | 3 | 0.2% |
| Other values (8) | 8 | 0.7% |
Missing 
| Distinct | 6 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 4515655 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 8.5 |
| Min length | 7 |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | pubiflora |
|---|---|
| 2nd row | simulans |
| 3rd row | sinuosa |
| 4th row | intricata |
| 5th row | nigricans |
| Value | Count | Frequency (%) |
| pubiflora | 1 | |
| simulans | 1 | |
| sinuosa | 1 | |
| intricata | 1 | |
| nigricans | 1 | |
| menziesii | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 10 | |
| a | 6 | |
| s | 6 | |
| n | 6 | |
| r | 3 | 5.9% |
| u | 3 | 5.9% |
| c | 2 | 3.9% |
| e | 2 | 3.9% |
| l | 2 | 3.9% |
| o | 2 | 3.9% |
| Other values (7) | 9 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 51 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 10 | |
| a | 6 | |
| s | 6 | |
| n | 6 | |
| r | 3 | 5.9% |
| u | 3 | 5.9% |
| c | 2 | 3.9% |
| e | 2 | 3.9% |
| l | 2 | 3.9% |
| o | 2 | 3.9% |
| Other values (7) | 9 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 51 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 10 | |
| a | 6 | |
| s | 6 | |
| n | 6 | |
| r | 3 | 5.9% |
| u | 3 | 5.9% |
| c | 2 | 3.9% |
| e | 2 | 3.9% |
| l | 2 | 3.9% |
| o | 2 | 3.9% |
| Other values (7) | 9 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 51 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 10 | |
| a | 6 | |
| s | 6 | |
| n | 6 | |
| r | 3 | 5.9% |
| u | 3 | 5.9% |
| c | 2 | 3.9% |
| e | 2 | 3.9% |
| l | 2 | 3.9% |
| o | 2 | 3.9% |
| Other values (7) | 9 |
parentNameUsage
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 4515659 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 7 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | pubiflora |
|---|---|
| 2nd row | pinguis |
| Value | Count | Frequency (%) |
| pubiflora | 1 | |
| pinguis | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 3 | |
| p | 2 | |
| u | 2 | |
| b | 1 | 6.2% |
| f | 1 | 6.2% |
| l | 1 | 6.2% |
| o | 1 | 6.2% |
| r | 1 | 6.2% |
| a | 1 | 6.2% |
| n | 1 | 6.2% |
| Other values (2) | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 16 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 3 | |
| p | 2 | |
| u | 2 | |
| b | 1 | 6.2% |
| f | 1 | 6.2% |
| l | 1 | 6.2% |
| o | 1 | 6.2% |
| r | 1 | 6.2% |
| a | 1 | 6.2% |
| n | 1 | 6.2% |
| Other values (2) | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 16 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 3 | |
| p | 2 | |
| u | 2 | |
| b | 1 | 6.2% |
| f | 1 | 6.2% |
| l | 1 | 6.2% |
| o | 1 | 6.2% |
| r | 1 | 6.2% |
| a | 1 | 6.2% |
| n | 1 | 6.2% |
| Other values (2) | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 3 | |
| p | 2 | |
| u | 2 | |
| b | 1 | 6.2% |
| f | 1 | 6.2% |
| l | 1 | 6.2% |
| o | 1 | 6.2% |
| r | 1 | 6.2% |
| a | 1 | 6.2% |
| n | 1 | 6.2% |
| Other values (2) | 2 |
nameAccordingTo
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 4515660 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | variety |
|---|
| Value | Count | Frequency (%) |
| variety | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| v | 1 | |
| a | 1 | |
| r | 1 | |
| i | 1 | |
| e | 1 | |
| t | 1 | |
| y | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| v | 1 | |
| a | 1 | |
| r | 1 | |
| i | 1 | |
| e | 1 | |
| t | 1 | |
| y | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| v | 1 | |
| a | 1 | |
| r | 1 | |
| i | 1 | |
| e | 1 | |
| t | 1 | |
| y | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| v | 1 | |
| a | 1 | |
| r | 1 | |
| i | 1 | |
| e | 1 | |
| t | 1 | |
| y | 1 |
Missing 
| Distinct | 5 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 4515656 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 34 |
|---|---|
| Median length | 27 |
| Mean length | 21.6 |
| Min length | 6 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | (Benth.) Sprague & Sandwith |
|---|---|
| 2nd row | (Weber Bosse) Pestana et al. |
| 3rd row | (K. Mert. ex Roth) Derbes & Solier |
| 4th row | J.V.Lamouroux |
| 5th row | Decne. |
| Value | Count | Frequency (%) |
| 2 | 11.1% | |
| benth | 1 | 5.6% |
| k | 1 | 5.6% |
| j.v.lamouroux | 1 | 5.6% |
| solier | 1 | 5.6% |
| derbes | 1 | 5.6% |
| roth | 1 | 5.6% |
| ex | 1 | 5.6% |
| mert | 1 | 5.6% |
| al | 1 | 5.6% |
| Other values (7) | 7 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 14 | 13.0% |
| 13 | 12.0% | |
| . | 7 | 6.5% |
| t | 6 | 5.6% |
| r | 6 | 5.6% |
| a | 6 | 5.6% |
| o | 5 | 4.6% |
| n | 4 | 3.7% |
| s | 4 | 3.7% |
| ( | 3 | 2.8% |
| Other values (25) | 40 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 65 | |
| Uppercase Letter | 15 | 13.9% |
| Space Separator | 13 | 12.0% |
| Other Punctuation | 9 | 8.3% |
| Open Punctuation | 3 | 2.8% |
| Close Punctuation | 3 | 2.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 14 | |
| t | 6 | |
| r | 6 | |
| a | 6 | |
| o | 5 | 7.7% |
| n | 4 | 6.2% |
| s | 4 | 6.2% |
| h | 3 | 4.6% |
| u | 3 | 4.6% |
| i | 2 | 3.1% |
| Other values (9) | 12 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 3 | |
| B | 2 | |
| D | 2 | |
| L | 1 | 6.7% |
| J | 1 | 6.7% |
| V | 1 | 6.7% |
| R | 1 | 6.7% |
| M | 1 | 6.7% |
| K | 1 | 6.7% |
| P | 1 | 6.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 7 | |
| & | 2 | 22.2% |
Space Separator
| Value | Count | Frequency (%) |
| 13 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 3 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 80 | |
| Common | 28 | 25.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 14 | |
| t | 6 | 7.5% |
| r | 6 | 7.5% |
| a | 6 | 7.5% |
| o | 5 | 6.2% |
| n | 4 | 5.0% |
| s | 4 | 5.0% |
| h | 3 | 3.8% |
| S | 3 | 3.8% |
| u | 3 | 3.8% |
| Other values (20) | 26 |
Common
| Value | Count | Frequency (%) |
| 13 | ||
| . | 7 | |
| ( | 3 | 10.7% |
| ) | 3 | 10.7% |
| & | 2 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 108 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 14 | 13.0% |
| 13 | 12.0% | |
| . | 7 | 6.5% |
| t | 6 | 5.6% |
| r | 6 | 5.6% |
| a | 6 | 5.6% |
| o | 5 | 4.6% |
| n | 4 | 3.7% |
| s | 4 | 3.7% |
| ( | 3 | 2.8% |
| Other values (25) | 40 |
| Distinct | 2220 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 13879 |
| Missing (%) | 0.3% |
| Memory size | 34.5 MiB |
Length
| Max length | 135 |
|---|---|
| Median length | 89 |
| Mean length | 55.77126436 |
| Min length | 6 |
Unique
| Unique | 232 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Plantae, Rhodophyta, Corallinales, Lithothamniaceae |
|---|---|
| 2nd row | Plantae, Dicotyledonae, Fabales, Fabaceae, Papilionoideae |
| 3rd row | Plantae, Monocotyledonae, Commelinales, Commelinaceae |
| 4th row | Plantae, Dicotyledonae, Oxalidales, Connaraceae |
| 5th row | Plantae, Monocotyledonae, Poales, Poaceae, Chloridoideae |
| Value | Count | Frequency (%) |
| plantae | 4143724 | 19.6% |
| dicotyledonae | 2583518 | 12.2% |
| monocotyledonae | 909034 | 4.3% |
| poales | 702157 | 3.3% |
| poaceae | 502389 | 2.4% |
| asterales | 380254 | 1.8% |
| asteraceae | 358301 | 1.7% |
| asteroideae | 282993 | 1.3% |
| pteridophyte | 276624 | 1.3% |
| lamiales | 266378 | 1.3% |
| Other values (2232) | 10769135 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 35371989 | |
| e | 35160991 | |
| o | 18857025 | 7.5% |
| 16672725 | 6.6% | |
| , | 16563118 | 6.6% |
| l | 16272824 | 6.5% |
| n | 12770502 | 5.1% |
| t | 12542392 | 5.0% |
| i | 12457371 | 5.0% |
| c | 11339619 | 4.5% |
| Other values (50) | 63061518 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 196578462 | |
| Uppercase Letter | 21064893 | 8.4% |
| Space Separator | 16672725 | 6.6% |
| Other Punctuation | 16583195 | 6.6% |
| Close Punctuation | 85296 | < 0.1% |
| Open Punctuation | 85296 | < 0.1% |
| Dash Punctuation | 207 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 35371989 | |
| e | 35160991 | |
| o | 18857025 | |
| l | 16272824 | |
| n | 12770502 | 6.5% |
| t | 12542392 | 6.4% |
| i | 12457371 | 6.3% |
| c | 11339619 | 5.8% |
| s | 7950561 | 4.0% |
| d | 7948035 | 4.0% |
| Other values (17) | 25907153 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 7116979 | |
| D | 2790296 | 13.2% |
| A | 1980335 | 9.4% |
| M | 1829877 | 8.7% |
| C | 1526896 | 7.2% |
| L | 903009 | 4.3% |
| F | 880493 | 4.2% |
| R | 765320 | 3.6% |
| B | 726846 | 3.5% |
| S | 642548 | 3.1% |
| Other values (16) | 1902294 | 9.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 16563118 | |
| . | 20069 | 0.1% |
| ? | 8 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 16672725 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 85296 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 85296 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 207 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 217643355 | |
| Common | 33426719 | 13.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 35371989 | |
| e | 35160991 | |
| o | 18857025 | |
| l | 16272824 | 7.5% |
| n | 12770502 | 5.9% |
| t | 12542392 | 5.8% |
| i | 12457371 | 5.7% |
| c | 11339619 | 5.2% |
| s | 7950561 | 3.7% |
| d | 7948035 | 3.7% |
| Other values (43) | 46972046 |
Common
| Value | Count | Frequency (%) |
| 16672725 | ||
| , | 16563118 | |
| ) | 85296 | 0.3% |
| ( | 85296 | 0.3% |
| . | 20069 | 0.1% |
| - | 207 | < 0.1% |
| ? | 8 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 251069452 | |
| None | 622 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 35371989 | |
| e | 35160991 | |
| o | 18857025 | 7.5% |
| 16672725 | 6.6% | |
| , | 16563118 | 6.6% |
| l | 16272824 | 6.5% |
| n | 12770502 | 5.1% |
| t | 12542392 | 5.0% |
| i | 12457371 | 5.0% |
| c | 11339619 | 4.5% |
| Other values (49) | 63060896 |
None
| Value | Count | Frequency (%) |
| ö | 622 |
kingdom
Text
| Distinct | 13 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 15585 |
| Missing (%) | 0.3% |
| Memory size | 34.5 MiB |
Length
| Max length | 33 |
|---|---|
| Median length | 7 |
| Mean length | 6.96302618 |
| Min length | 5 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Plantae |
|---|---|
| 2nd row | Plantae |
| 3rd row | Plantae |
| 4th row | Plantae |
| 5th row | Plantae |
| Value | Count | Frequency (%) |
| plantae | 4143685 | |
| fungi | 223269 | 5.0% |
| eubacteria | 52558 | 1.2% |
| chromista | 41845 | 0.9% |
| protista | 38689 | 0.9% |
| protozoa | 24 | < 0.1% |
| incertae | 3 | < 0.1% |
| sedis | 3 | < 0.1% |
| prokaryota | 2 | < 0.1% |
| kingdom | 2 | < 0.1% |
| Other values (4) | 4 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 8473056 | |
| n | 4366962 | |
| t | 4315498 | |
| e | 4196255 | |
| P | 4182397 | |
| l | 4143685 | |
| i | 356370 | 1.1% |
| u | 275829 | 0.9% |
| g | 223272 | 0.7% |
| F | 223267 | 0.7% |
| Other values (18) | 577556 | 1.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 26834068 | |
| Uppercase Letter | 4500071 | 14.4% |
| Space Separator | 8 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 8473056 | |
| n | 4366962 | |
| t | 4315498 | |
| e | 4196255 | |
| l | 4143685 | |
| i | 356370 | 1.3% |
| u | 275829 | 1.0% |
| g | 223272 | 0.8% |
| r | 133125 | 0.5% |
| o | 80614 | 0.3% |
| Other values (11) | 269402 | 1.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 4182397 | |
| F | 223267 | 5.0% |
| E | 52559 | 1.2% |
| C | 41845 | 0.9% |
| I | 2 | < 0.1% |
| B | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 8 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 31334139 | |
| Common | 8 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 8473056 | |
| n | 4366962 | |
| t | 4315498 | |
| e | 4196255 | |
| P | 4182397 | |
| l | 4143685 | |
| i | 356370 | 1.1% |
| u | 275829 | 0.9% |
| g | 223272 | 0.7% |
| F | 223267 | 0.7% |
| Other values (17) | 577548 | 1.8% |
Common
| Value | Count | Frequency (%) |
| 8 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 31334147 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 8473056 | |
| n | 4366962 | |
| t | 4315498 | |
| e | 4196255 | |
| P | 4182397 | |
| l | 4143685 | |
| i | 356370 | 1.1% |
| u | 275829 | 0.9% |
| g | 223272 | 0.7% |
| F | 223267 | 0.7% |
| Other values (18) | 577556 | 1.8% |
phylum
Text
Missing 
| Distinct | 37 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3795307 |
| Missing (%) | 84.0% |
| Memory size | 34.5 MiB |
Length
| Max length | 27 |
|---|---|
| Median length | 10 |
| Mean length | 10.45428498 |
| Min length | 6 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Rhodophyta |
|---|---|
| 2nd row | Bryophyta |
| 3rd row | Ascomycota |
| 4th row | Rhodophyta |
| 5th row | Bryophyta |
| Value | Count | Frequency (%) |
| ascomycota | 220381 | |
| bryophyta | 148369 | |
| rhodophyta | 121618 | |
| cyanobacteria | 52553 | 7.3% |
| chlorophyta | 44629 | 6.2% |
| bacillariophyta | 33225 | 4.6% |
| ochrophyta | 29634 | 4.1% |
| marchantiophyta | 26552 | 3.7% |
| pinophyta | 21234 | 2.9% |
| miozoa | 4849 | 0.7% |
| Other values (29) | 18257 | 2.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 1114609 | |
| a | 950427 | |
| y | 866328 | |
| t | 747357 | |
| h | 664707 | |
| c | 586886 | |
| p | 438109 | 5.8% |
| r | 348688 | 4.6% |
| s | 224255 | 3.0% |
| m | 222302 | 3.0% |
| Other values (26) | 1367118 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6807890 | |
| Uppercase Letter | 720354 | 9.6% |
| Other Punctuation | 1595 | < 0.1% |
| Space Separator | 947 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 1114609 | |
| a | 950427 | |
| y | 866328 | |
| t | 747357 | |
| h | 664707 | |
| c | 586886 | |
| p | 438109 | 6.4% |
| r | 348688 | 5.1% |
| s | 224255 | 3.3% |
| m | 222302 | 3.3% |
| Other values (11) | 644222 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 220877 | |
| B | 183494 | |
| R | 121618 | |
| C | 101098 | |
| M | 32077 | 4.5% |
| O | 29634 | 4.1% |
| P | 25420 | 3.5% |
| I | 2540 | 0.4% |
| G | 2063 | 0.3% |
| H | 914 | 0.1% |
| Other values (3) | 619 | 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1595 |
Space Separator
| Value | Count | Frequency (%) |
| 947 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7528244 | |
| Common | 2542 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 1114609 | |
| a | 950427 | |
| y | 866328 | |
| t | 747357 | |
| h | 664707 | |
| c | 586886 | |
| p | 438109 | 5.8% |
| r | 348688 | 4.6% |
| s | 224255 | 3.0% |
| m | 222302 | 3.0% |
| Other values (24) | 1364576 |
Common
| Value | Count | Frequency (%) |
| . | 1595 | |
| 947 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7530786 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 1114609 | |
| a | 950427 | |
| y | 866328 | |
| t | 747357 | |
| h | 664707 | |
| c | 586886 | |
| p | 438109 | 5.8% |
| r | 348688 | 4.6% |
| s | 224255 | 3.0% |
| m | 222302 | 3.0% |
| Other values (26) | 1367118 |
class
Text
Missing 
| Distinct | 88 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 166450 |
| Missing (%) | 3.7% |
| Memory size | 34.5 MiB |
Length
| Max length | 33 |
|---|---|
| Median length | 13 |
| Mean length | 13.51310502 |
| Min length | 6 |
Unique
| Unique | 14 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Dicotyledonae |
|---|---|
| 2nd row | Monocotyledonae |
| 3rd row | Dicotyledonae |
| 4th row | Monocotyledonae |
| 5th row | Dicotyledonae |
| Value | Count | Frequency (%) |
| dicotyledonae | 2583517 | |
| monocotyledonae | 909034 | 20.5% |
| pteridophyte | 276624 | 6.2% |
| lecanoromycetes | 202991 | 4.6% |
| bryopsida | 127668 | 2.9% |
| florideophyceae | 89454 | 2.0% |
| basal | 85188 | 1.9% |
| ulvophyceae | 35522 | 0.8% |
| jungermanniopsida | 25865 | 0.6% |
| pinopsida | 23209 | 0.5% |
| Other values (80) | 76533 | 1.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 9980172 | |
| e | 8651381 | |
| n | 4733378 | |
| t | 4293923 | |
| y | 4289084 | |
| a | 4288463 | |
| c | 4090012 | |
| d | 4070361 | |
| l | 3713365 | 6.3% |
| i | 3238121 | 5.5% |
| Other values (37) | 7423085 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 54165150 | |
| Uppercase Letter | 4349208 | 7.4% |
| Space Separator | 86394 | 0.1% |
| Open Punctuation | 85188 | 0.1% |
| Close Punctuation | 85188 | 0.1% |
| Other Punctuation | 217 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 9980172 | |
| e | 8651381 | |
| n | 4733378 | |
| t | 4293923 | |
| y | 4289084 | |
| a | 4288463 | |
| c | 4090012 | |
| d | 4070361 | |
| l | 3713365 | 6.9% |
| i | 3238121 | 6.0% |
| Other values (13) | 2816890 | 5.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 2591725 | |
| M | 910509 | 20.9% |
| P | 327909 | 7.5% |
| L | 204284 | 4.7% |
| B | 135509 | 3.1% |
| F | 89454 | 2.1% |
| U | 35524 | 0.8% |
| J | 25865 | 0.6% |
| A | 8728 | 0.2% |
| S | 7620 | 0.2% |
| Other values (10) | 12081 | 0.3% |
Space Separator
| Value | Count | Frequency (%) |
| 86394 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 85188 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 85188 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 217 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 58514358 | |
| Common | 256987 | 0.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 9980172 | |
| e | 8651381 | |
| n | 4733378 | |
| t | 4293923 | |
| y | 4289084 | |
| a | 4288463 | |
| c | 4090012 | |
| d | 4070361 | |
| l | 3713365 | 6.3% |
| i | 3238121 | 5.5% |
| Other values (33) | 7166098 |
Common
| Value | Count | Frequency (%) |
| 86394 | ||
| ( | 85188 | |
| ) | 85188 | |
| . | 217 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 58771345 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 9980172 | |
| e | 8651381 | |
| n | 4733378 | |
| t | 4293923 | |
| y | 4289084 | |
| a | 4288463 | |
| c | 4090012 | |
| d | 4070361 | |
| l | 3713365 | 6.3% |
| i | 3238121 | 5.5% |
| Other values (37) | 7423085 |
order
Text
Missing 
| Distinct | 404 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 53019 |
| Missing (%) | 1.2% |
| Memory size | 34.5 MiB |
Length
| Max length | 36 |
|---|---|
| Median length | 31 |
| Mean length | 9.299688615 |
| Min length | 6 |
Unique
| Unique | 39 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Corallinales |
|---|---|
| 2nd row | Fabales |
| 3rd row | Commelinales |
| 4th row | Oxalidales |
| 5th row | Poales |
| Value | Count | Frequency (%) |
| poales | 702157 | 15.7% |
| asterales | 380253 | 8.5% |
| lamiales | 266378 | 6.0% |
| fabales | 254592 | 5.7% |
| malpighiales | 211605 | 4.7% |
| polypodiales | 193095 | 4.3% |
| gentianales | 180579 | 4.0% |
| myrtales | 158308 | 3.5% |
| caryophyllales | 147659 | 3.3% |
| ericales | 129764 | 2.9% |
| Other values (398) | 1839979 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 6796977 | |
| l | 5858220 | |
| e | 5561572 | |
| s | 5364064 | |
| i | 2260328 | 5.4% |
| o | 2043726 | 4.9% |
| r | 1726800 | 4.2% |
| n | 1206408 | 2.9% |
| t | 1060667 | 2.6% |
| P | 1016591 | 2.4% |
| Other values (41) | 8605828 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 37031041 | |
| Uppercase Letter | 4462642 | 10.8% |
| Other Punctuation | 5771 | < 0.1% |
| Space Separator | 1727 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 6796977 | |
| l | 5858220 | |
| e | 5561572 | |
| s | 5364064 | |
| i | 2260328 | 6.1% |
| o | 2043726 | 5.5% |
| r | 1726800 | 4.7% |
| n | 1206408 | 3.3% |
| t | 1060667 | 2.9% |
| p | 1000669 | 2.7% |
| Other values (15) | 4151610 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1016591 | |
| A | 581960 | |
| M | 481360 | |
| L | 454975 | |
| C | 361177 | 8.1% |
| F | 296457 | 6.6% |
| S | 246396 | 5.5% |
| G | 227690 | 5.1% |
| R | 187767 | 4.2% |
| E | 144300 | 3.2% |
| Other values (14) | 463969 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 5771 |
Space Separator
| Value | Count | Frequency (%) |
| 1727 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 41493683 | |
| Common | 7498 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 6796977 | |
| l | 5858220 | |
| e | 5561572 | |
| s | 5364064 | |
| i | 2260328 | 5.4% |
| o | 2043726 | 4.9% |
| r | 1726800 | 4.2% |
| n | 1206408 | 2.9% |
| t | 1060667 | 2.6% |
| P | 1016591 | 2.4% |
| Other values (39) | 8598330 |
Common
| Value | Count | Frequency (%) |
| . | 5771 | |
| 1727 | 23.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 41501181 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 6796977 | |
| l | 5858220 | |
| e | 5561572 | |
| s | 5364064 | |
| i | 2260328 | 5.4% |
| o | 2043726 | 4.9% |
| r | 1726800 | 4.2% |
| n | 1206408 | 2.9% |
| t | 1060667 | 2.6% |
| P | 1016591 | 2.4% |
| Other values (41) | 8605828 |
family
Text
Missing 
| Distinct | 1349 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 49040 |
| Missing (%) | 1.1% |
| Memory size | 34.5 MiB |
Length
| Max length | 38 |
|---|---|
| Median length | 34 |
| Mean length | 10.77015601 |
| Min length | 6 |
Unique
| Unique | 107 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Lithothamniaceae |
|---|---|
| 2nd row | Fabaceae |
| 3rd row | Commelinaceae |
| 4th row | Connaraceae |
| 5th row | Poaceae |
| Value | Count | Frequency (%) |
| poaceae | 502389 | 11.2% |
| asteraceae | 358301 | 8.0% |
| fabaceae | 237916 | 5.3% |
| cyperaceae | 139791 | 3.1% |
| rubiaceae | 119885 | 2.7% |
| melastomataceae | 73747 | 1.6% |
| parmeliaceae | 66931 | 1.5% |
| rosaceae | 65723 | 1.5% |
| lamiaceae | 62361 | 1.4% |
| euphorbiaceae | 59298 | 1.3% |
| Other values (1336) | 2800225 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 10978354 | |
| e | 10624096 | |
| c | 5308142 | |
| i | 2196406 | 4.6% |
| r | 2091808 | 4.3% |
| o | 2020260 | 4.2% |
| l | 1542594 | 3.2% |
| t | 1385215 | 2.9% |
| n | 1300651 | 2.7% |
| s | 1002425 | 2.1% |
| Other values (47) | 9656254 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 43607763 | |
| Uppercase Letter | 4466621 | 9.3% |
| Space Separator | 19946 | < 0.1% |
| Other Punctuation | 11687 | < 0.1% |
| Open Punctuation | 94 | < 0.1% |
| Close Punctuation | 94 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 10978354 | |
| e | 10624096 | |
| c | 5308142 | |
| i | 2196406 | 5.0% |
| r | 2091808 | 4.8% |
| o | 2020260 | 4.6% |
| l | 1542594 | 3.5% |
| t | 1385215 | 3.2% |
| n | 1300651 | 3.0% |
| s | 1002425 | 2.3% |
| Other values (16) | 5157812 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 961089 | |
| A | 673867 | |
| C | 509106 | |
| R | 296480 | 6.6% |
| F | 267712 | 6.0% |
| S | 249020 | 5.6% |
| M | 248845 | 5.6% |
| L | 198706 | 4.4% |
| B | 172370 | 3.9% |
| O | 153205 | 3.4% |
| Other values (16) | 736221 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 11679 | |
| ? | 8 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 19946 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 94 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 94 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 48074384 | |
| Common | 31821 | 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 10978354 | |
| e | 10624096 | |
| c | 5308142 | |
| i | 2196406 | 4.6% |
| r | 2091808 | 4.4% |
| o | 2020260 | 4.2% |
| l | 1542594 | 3.2% |
| t | 1385215 | 2.9% |
| n | 1300651 | 2.7% |
| s | 1002425 | 2.1% |
| Other values (42) | 9624433 |
Common
| Value | Count | Frequency (%) |
| 19946 | ||
| . | 11679 | |
| ( | 94 | 0.3% |
| ) | 94 | 0.3% |
| ? | 8 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 48106205 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 10978354 | |
| e | 10624096 | |
| c | 5308142 | |
| i | 2196406 | 4.6% |
| r | 2091808 | 4.3% |
| o | 2020260 | 4.2% |
| l | 1542594 | 3.2% |
| t | 1385215 | 2.9% |
| n | 1300651 | 2.7% |
| s | 1002425 | 2.1% |
| Other values (47) | 9656254 |
genus
Text
| Distinct | 19533 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 13790 |
| Missing (%) | 0.3% |
| Memory size | 34.5 MiB |
Length
| Max length | 28 |
|---|---|
| Median length | 21 |
| Mean length | 8.779886629 |
| Min length | 2 |
Unique
| Unique | 2723 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Lithothamnion |
|---|---|
| 2nd row | Amicia |
| 3rd row | Tripogandra |
| 4th row | Connarus |
| 5th row | Trichoneura |
| Value | Count | Frequency (%) |
| carex | 58129 | 1.3% |
| indet | 34179 | 0.8% |
| poa | 30191 | 0.7% |
| cyperus | 27842 | 0.6% |
| cladonia | 26929 | 0.6% |
| paspalum | 26142 | 0.6% |
| solanum | 24852 | 0.6% |
| miconia | 24577 | 0.5% |
| eragrostis | 23701 | 0.5% |
| asplenium | 20032 | 0.4% |
| Other values (19524) | 4220155 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 4832994 | 12.2% |
| i | 3638319 | 9.2% |
| o | 2765978 | 7.0% |
| e | 2764006 | 7.0% |
| r | 2582882 | 6.5% |
| l | 2161942 | 5.5% |
| n | 2070715 | 5.2% |
| s | 2058198 | 5.2% |
| u | 1993081 | 5.0% |
| t | 1685888 | 4.3% |
| Other values (49) | 12971914 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 34974819 | |
| Uppercase Letter | 4501774 | 11.4% |
| Other Punctuation | 34178 | 0.1% |
| Space Separator | 14858 | < 0.1% |
| Open Punctuation | 97 | < 0.1% |
| Close Punctuation | 97 | < 0.1% |
| Dash Punctuation | 94 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4832994 | |
| i | 3638319 | |
| o | 2765978 | 7.9% |
| e | 2764006 | 7.9% |
| r | 2582882 | 7.4% |
| l | 2161942 | 6.2% |
| n | 2070715 | 5.9% |
| s | 2058198 | 5.9% |
| u | 1993081 | 5.7% |
| t | 1685888 | 4.8% |
| Other values (17) | 8420816 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 641047 | |
| P | 589254 | |
| S | 440682 | |
| A | 403961 | 9.0% |
| M | 286195 | 6.4% |
| E | 238456 | 5.3% |
| L | 237999 | 5.3% |
| D | 211365 | 4.7% |
| B | 192319 | 4.3% |
| H | 182246 | 4.0% |
| Other values (16) | 1078250 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 34174 | |
| / | 4 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 14858 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 97 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 97 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 94 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 39476593 | |
| Common | 49324 | 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 4832994 | 12.2% |
| i | 3638319 | 9.2% |
| o | 2765978 | 7.0% |
| e | 2764006 | 7.0% |
| r | 2582882 | 6.5% |
| l | 2161942 | 5.5% |
| n | 2070715 | 5.2% |
| s | 2058198 | 5.2% |
| u | 1993081 | 5.0% |
| t | 1685888 | 4.3% |
| Other values (43) | 12922590 |
Common
| Value | Count | Frequency (%) |
| . | 34174 | |
| 14858 | ||
| ( | 97 | 0.2% |
| ) | 97 | 0.2% |
| - | 94 | 0.2% |
| / | 4 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 39525225 | |
| None | 692 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 4832994 | 12.2% |
| i | 3638319 | 9.2% |
| o | 2765978 | 7.0% |
| e | 2764006 | 7.0% |
| r | 2582882 | 6.5% |
| l | 2161942 | 5.5% |
| n | 2070715 | 5.2% |
| s | 2058198 | 5.2% |
| u | 1993081 | 5.0% |
| t | 1685888 | 4.3% |
| Other values (48) | 12971222 |
None
| Value | Count | Frequency (%) |
| ë | 692 |
subgenus
Text
Missing 
| Distinct | 14 |
|---|---|
| Distinct (%) | 15.7% |
| Missing | 4515572 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 17 |
| Mean length | 12.14606742 |
| Min length | 6 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | 5.6% |
Sample
| 1st row | Choanopsis |
|---|---|
| 2nd row | Leptostemonum |
| 3rd row | Leptostemonum |
| 4th row | Pseudopoa |
| 5th row | Leptostemonum |
| Value | Count | Frequency (%) |
| leptostemonum | 41 | |
| meniscium | 13 | 14.4% |
| goniophlebiopteris | 10 | 11.1% |
| pseudopoa | 6 | 6.7% |
| choanopsis | 5 | 5.6% |
| penzigia | 3 | 3.3% |
| arenariae | 2 | 2.2% |
| trichochloa | 2 | 2.2% |
| pseudolysimachium | 2 | 2.2% |
| limnochloa | 1 | 1.1% |
| Other values (5) | 5 | 5.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 147 | |
| e | 131 | |
| m | 100 | |
| t | 93 | |
| s | 84 | |
| i | 82 | |
| n | 78 | |
| p | 74 | |
| u | 66 | 6.1% |
| L | 42 | 3.9% |
| Other values (22) | 184 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 991 | |
| Uppercase Letter | 88 | 8.1% |
| Other Punctuation | 1 | 0.1% |
| Space Separator | 1 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 147 | |
| e | 131 | |
| m | 100 | |
| t | 93 | |
| s | 84 | |
| i | 82 | |
| n | 78 | |
| p | 74 | |
| u | 66 | |
| a | 28 | 2.8% |
| Other values (11) | 108 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 42 | |
| M | 13 | 14.8% |
| P | 11 | 12.5% |
| G | 10 | 11.4% |
| C | 6 | 6.8% |
| A | 2 | 2.3% |
| T | 2 | 2.3% |
| D | 1 | 1.1% |
| F | 1 | 1.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| § | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1079 | |
| Common | 2 | 0.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 147 | |
| e | 131 | |
| m | 100 | |
| t | 93 | |
| s | 84 | |
| i | 82 | |
| n | 78 | |
| p | 74 | |
| u | 66 | 6.1% |
| L | 42 | 3.9% |
| Other values (20) | 182 |
Common
| Value | Count | Frequency (%) |
| § | 1 | |
| 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1080 | |
| None | 1 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 147 | |
| e | 131 | |
| m | 100 | |
| t | 93 | |
| s | 84 | |
| i | 82 | |
| n | 78 | |
| p | 74 | |
| u | 66 | 6.1% |
| L | 42 | 3.9% |
| Other values (21) | 183 |
None
| Value | Count | Frequency (%) |
| § | 1 |
specificEpithet
Text
| Distinct | 75699 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 19440 |
| Missing (%) | 0.4% |
| Memory size | 34.5 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 27 |
| Mean length | 8.783425904 |
| Min length | 2 |
Unique
| Unique | 20935 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | calcareum |
|---|---|
| 2nd row | glandulosa |
| 3rd row | glandulosa |
| 4th row | steyermarkii |
| 5th row | grandiglumis |
| Value | Count | Frequency (%) |
| sp | 270345 | 6.0% |
| canadensis | 11707 | 0.3% |
| guianensis | 11466 | 0.3% |
| americana | 11279 | 0.3% |
| latifolia | 11149 | 0.2% |
| repens | 10154 | 0.2% |
| parviflora | 10009 | 0.2% |
| occidentalis | 9667 | 0.2% |
| gracilis | 9142 | 0.2% |
| indica | 9037 | 0.2% |
| Other values (75609) | 4133864 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 5237374 | |
| i | 4468913 | |
| s | 3076318 | 7.8% |
| e | 2756650 | 7.0% |
| r | 2532177 | 6.4% |
| l | 2515617 | 6.4% |
| n | 2415168 | 6.1% |
| u | 2268906 | 5.7% |
| o | 2256284 | 5.7% |
| t | 2041234 | 5.2% |
| Other values (45) | 9923583 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 39191766 | |
| Other Punctuation | 278333 | 0.7% |
| Dash Punctuation | 20172 | 0.1% |
| Space Separator | 1598 | < 0.1% |
| Decimal Number | 288 | < 0.1% |
| Open Punctuation | 30 | < 0.1% |
| Close Punctuation | 30 | < 0.1% |
| Math Symbol | 7 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 5237374 | |
| i | 4468913 | |
| s | 3076318 | 7.8% |
| e | 2756650 | 7.0% |
| r | 2532177 | 6.5% |
| l | 2515617 | 6.4% |
| n | 2415168 | 6.2% |
| u | 2268906 | 5.8% |
| o | 2256284 | 5.8% |
| t | 2041234 | 5.2% |
| Other values (19) | 9623125 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 93 | |
| 2 | 48 | |
| 0 | 41 | |
| 8 | 39 | |
| 6 | 25 | 8.7% |
| 3 | 18 | 6.2% |
| 5 | 9 | 3.1% |
| 4 | 9 | 3.1% |
| 7 | 5 | 1.7% |
| 9 | 1 | 0.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 278069 | |
| " | 114 | < 0.1% |
| ' | 90 | < 0.1% |
| / | 33 | < 0.1% |
| ? | 14 | < 0.1% |
| # | 11 | < 0.1% |
| , | 1 | < 0.1% |
| * | 1 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 28 | |
| [ | 2 | 6.7% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 28 | |
| ] | 2 | 6.7% |
Math Symbol
| Value | Count | Frequency (%) |
| × | 4 | |
| ~ | 3 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 20172 |
Space Separator
| Value | Count | Frequency (%) |
| 1598 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 39191766 | |
| Common | 300458 | 0.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 5237374 | |
| i | 4468913 | |
| s | 3076318 | 7.8% |
| e | 2756650 | 7.0% |
| r | 2532177 | 6.5% |
| l | 2515617 | 6.4% |
| n | 2415168 | 6.2% |
| u | 2268906 | 5.8% |
| o | 2256284 | 5.8% |
| t | 2041234 | 5.2% |
| Other values (19) | 9623125 |
Common
| Value | Count | Frequency (%) |
| . | 278069 | |
| - | 20172 | 6.7% |
| 1598 | 0.5% | |
| " | 114 | < 0.1% |
| 1 | 93 | < 0.1% |
| ' | 90 | < 0.1% |
| 2 | 48 | < 0.1% |
| 0 | 41 | < 0.1% |
| 8 | 39 | < 0.1% |
| / | 33 | < 0.1% |
| Other values (16) | 161 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 39492142 | |
| None | 82 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 5237374 | |
| i | 4468913 | |
| s | 3076318 | 7.8% |
| e | 2756650 | 7.0% |
| r | 2532177 | 6.4% |
| l | 2515617 | 6.4% |
| n | 2415168 | 6.1% |
| u | 2268906 | 5.7% |
| o | 2256284 | 5.7% |
| t | 2041234 | 5.2% |
| Other values (41) | 9923501 |
None
| Value | Count | Frequency (%) |
| ë | 68 | |
| ü | 9 | 11.0% |
| × | 4 | 4.9% |
| ñ | 1 | 1.2% |
Missing 
| Distinct | 13508 |
|---|---|
| Distinct (%) | 4.2% |
| Missing | 4196068 |
| Missing (%) | 92.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 33 |
|---|---|
| Median length | 29 |
| Mean length | 9.193180076 |
| Min length | 1 |
Unique
| Unique | 4773 ? |
|---|---|
| Unique (%) | 1.5% |
Sample
| 1st row | oxyphylla |
|---|---|
| 2nd row | subalpinum |
| 3rd row | purpurescens |
| 4th row | pubescens |
| 5th row | hirsuta |
| Value | Count | Frequency (%) |
| acuminatum | 4368 | 1.4% |
| pubescens | 1879 | 0.6% |
| secunda | 1646 | 0.5% |
| dichotomum | 1521 | 0.5% |
| americana | 1487 | 0.5% |
| gracilis | 1466 | 0.5% |
| angustifolia | 1339 | 0.4% |
| typica | 1218 | 0.4% |
| occidentalis | 1214 | 0.4% |
| glauca | 1198 | 0.4% |
| Other values (13459) | 302708 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 401235 | |
| i | 336149 | |
| s | 216184 | 7.4% |
| e | 204762 | 7.0% |
| l | 195545 | 6.7% |
| n | 184185 | 6.3% |
| r | 181755 | 6.2% |
| u | 176685 | 6.0% |
| o | 167405 | 5.7% |
| t | 150042 | 5.1% |
| Other values (38) | 724129 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2936297 | |
| Dash Punctuation | 833 | < 0.1% |
| Space Separator | 451 | < 0.1% |
| Other Punctuation | 290 | < 0.1% |
| Uppercase Letter | 90 | < 0.1% |
| Open Punctuation | 55 | < 0.1% |
| Close Punctuation | 55 | < 0.1% |
| Math Symbol | 5 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 401235 | |
| i | 336149 | |
| s | 216184 | 7.4% |
| e | 204762 | 7.0% |
| l | 195545 | 6.7% |
| n | 184185 | 6.3% |
| r | 181755 | 6.2% |
| u | 176685 | 6.0% |
| o | 167405 | 5.7% |
| t | 150042 | 5.1% |
| Other values (18) | 722350 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 23 | |
| F | 22 | |
| C | 14 | |
| A | 11 | |
| B | 7 | 7.8% |
| O | 4 | 4.4% |
| H | 4 | 4.4% |
| M | 2 | 2.2% |
| D | 1 | 1.1% |
| V | 1 | 1.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 234 | |
| ' | 25 | 8.6% |
| ? | 19 | 6.6% |
| " | 12 | 4.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 833 |
Space Separator
| Value | Count | Frequency (%) |
| 451 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 55 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 55 |
Math Symbol
| Value | Count | Frequency (%) |
| × | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2936387 | |
| Common | 1689 | 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 401235 | |
| i | 336149 | |
| s | 216184 | 7.4% |
| e | 204762 | 7.0% |
| l | 195545 | 6.7% |
| n | 184185 | 6.3% |
| r | 181755 | 6.2% |
| u | 176685 | 6.0% |
| o | 167405 | 5.7% |
| t | 150042 | 5.1% |
| Other values (29) | 722440 |
Common
| Value | Count | Frequency (%) |
| - | 833 | |
| 451 | ||
| . | 234 | 13.9% |
| ( | 55 | 3.3% |
| ) | 55 | 3.3% |
| ' | 25 | 1.5% |
| ? | 19 | 1.1% |
| " | 12 | 0.7% |
| × | 5 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2938068 | |
| None | 8 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 401235 | |
| i | 336149 | |
| s | 216184 | 7.4% |
| e | 204762 | 7.0% |
| l | 195545 | 6.7% |
| n | 184185 | 6.3% |
| r | 181755 | 6.2% |
| u | 176685 | 6.0% |
| o | 167405 | 5.7% |
| t | 150042 | 5.1% |
| Other values (35) | 724121 |
None
| Value | Count | Frequency (%) |
| × | 5 | |
| ë | 2 | 25.0% |
| ß | 1 | 12.5% |
cultivarEpithet
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 4515659 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 16 |
|---|---|
| Median length | 14.5 |
| Mean length | 14.5 |
| Min length | 13 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Melica torreyana |
|---|---|
| 2nd row | Diplazium sp. |
| Value | Count | Frequency (%) |
| melica | 1 | |
| torreyana | 1 | |
| diplazium | 1 | |
| sp | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 4 | |
| i | 3 | 10.3% |
| r | 2 | 6.9% |
| l | 2 | 6.9% |
| 2 | 6.9% | |
| e | 2 | 6.9% |
| p | 2 | 6.9% |
| D | 1 | 3.4% |
| s | 1 | 3.4% |
| m | 1 | 3.4% |
| Other values (9) | 9 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 24 | |
| Space Separator | 2 | 6.9% |
| Uppercase Letter | 2 | 6.9% |
| Other Punctuation | 1 | 3.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4 | |
| i | 3 | |
| r | 2 | 8.3% |
| l | 2 | 8.3% |
| e | 2 | 8.3% |
| p | 2 | 8.3% |
| s | 1 | 4.2% |
| m | 1 | 4.2% |
| u | 1 | 4.2% |
| z | 1 | 4.2% |
| Other values (5) | 5 |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 1 | |
| M | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 26 | |
| Common | 3 | 10.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 4 | |
| i | 3 | |
| r | 2 | 7.7% |
| l | 2 | 7.7% |
| e | 2 | 7.7% |
| p | 2 | 7.7% |
| D | 1 | 3.8% |
| s | 1 | 3.8% |
| m | 1 | 3.8% |
| u | 1 | 3.8% |
| Other values (7) | 7 |
Common
| Value | Count | Frequency (%) |
| 2 | ||
| . | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 29 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 4 | |
| i | 3 | 10.3% |
| r | 2 | 6.9% |
| l | 2 | 6.9% |
| 2 | 6.9% | |
| e | 2 | 6.9% |
| p | 2 | 6.9% |
| D | 1 | 3.4% |
| s | 1 | 3.4% |
| m | 1 | 3.4% |
| Other values (9) | 9 |
taxonRank
Text
Missing 
| Distinct | 28 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4196350 |
| Missing (%) | 92.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 7 |
| Mean length | 7.875528873 |
| Min length | 2 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | variety |
|---|---|
| 2nd row | Variety |
| 3rd row | variety |
| 4th row | subspecies |
| 5th row | Variety |
| Value | Count | Frequency (%) |
| variety | 207363 | |
| subspecies | 101047 | |
| forma | 8236 | 2.6% |
| var | 2270 | 0.7% |
| form | 85 | < 0.1% |
| subvariety | 81 | < 0.1% |
| aff | 73 | < 0.1% |
| nothosubsp | 57 | < 0.1% |
| agg | 18 | < 0.1% |
| fo | 18 | < 0.1% |
| Other values (15) | 63 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 409561 | |
| i | 308499 | |
| s | 303355 | |
| a | 218079 | |
| r | 218075 | |
| t | 207516 | |
| y | 207450 | |
| v | 181926 | |
| u | 101201 | 4.0% |
| b | 101194 | 4.0% |
| Other values (16) | 257887 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2484340 | |
| Uppercase Letter | 28022 | 1.1% |
| Other Punctuation | 2367 | 0.1% |
| Open Punctuation | 7 | < 0.1% |
| Close Punctuation | 7 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 409561 | |
| i | 308499 | |
| s | 303355 | |
| a | 218079 | |
| r | 218075 | |
| t | 207516 | |
| y | 207450 | |
| v | 181926 | |
| u | 101201 | 4.1% |
| b | 101194 | 4.1% |
| Other values (11) | 227484 |
Uppercase Letter
| Value | Count | Frequency (%) |
| V | 27814 | |
| F | 208 | 0.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2367 |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 7 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 7 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2512362 | |
| Common | 2381 | 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 409561 | |
| i | 308499 | |
| s | 303355 | |
| a | 218079 | |
| r | 218075 | |
| t | 207516 | |
| y | 207450 | |
| v | 181926 | |
| u | 101201 | 4.0% |
| b | 101194 | 4.0% |
| Other values (13) | 255506 |
Common
| Value | Count | Frequency (%) |
| . | 2367 | |
| [ | 7 | 0.3% |
| ] | 7 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2514743 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 409561 | |
| i | 308499 | |
| s | 303355 | |
| a | 218079 | |
| r | 218075 | |
| t | 207516 | |
| y | 207450 | |
| v | 181926 | |
| u | 101201 | 4.0% |
| b | 101194 | 4.0% |
| Other values (16) | 257887 |
Missing 
| Distinct | 61211 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 491289 |
| Missing (%) | 10.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 255 |
|---|---|
| Median length | 63 |
| Mean length | 11.67467396 |
| Min length | 2 |
Unique
| Unique | 12830 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | Kunth |
|---|---|
| 2nd row | (Seub.) Rohweder |
| 3rd row | Prance |
| 4th row | (Nees) Ekman |
| 5th row | (Britton ex Rusby) Wiehler |
| Value | Count | Frequency (%) |
| l | 655549 | 7.4% |
| 528030 | 6.0% | |
| ex | 293982 | 3.3% |
| a | 184606 | 2.1% |
| dc | 137922 | 1.6% |
| kunth | 108763 | 1.2% |
| gray | 104768 | 1.2% |
| benth | 100384 | 1.1% |
| sw | 88430 | 1.0% |
| hook | 85187 | 1.0% |
| Other values (10671) | 6515117 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 5755821 | 12.3% |
| 4778366 | 10.2% | |
| e | 2841039 | 6.0% |
| r | 2154959 | 4.6% |
| a | 1898912 | 4.0% |
| l | 1877752 | 4.0% |
| n | 1788431 | 3.8% |
| ( | 1676769 | 3.6% |
| ) | 1676769 | 3.6% |
| o | 1598443 | 3.4% |
| Other values (105) | 20935970 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 23970208 | |
| Uppercase Letter | 8556492 | 18.2% |
| Other Punctuation | 6296715 | 13.4% |
| Space Separator | 4778366 | 10.2% |
| Open Punctuation | 1676769 | 3.6% |
| Close Punctuation | 1676769 | 3.6% |
| Dash Punctuation | 27912 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 2841039 | |
| r | 2154959 | 9.0% |
| a | 1898912 | 7.9% |
| l | 1877752 | 7.8% |
| n | 1788431 | 7.5% |
| o | 1598443 | 6.7% |
| t | 1484295 | 6.2% |
| i | 1367987 | 5.7% |
| h | 1198850 | 5.0% |
| u | 1056566 | 4.4% |
| Other values (54) | 6702974 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 1021813 | |
| S | 792307 | 9.3% |
| B | 645199 | 7.5% |
| H | 609960 | 7.1% |
| M | 590105 | 6.9% |
| C | 571360 | 6.7% |
| A | 491164 | 5.7% |
| R | 469346 | 5.5% |
| G | 421747 | 4.9% |
| D | 402530 | 4.7% |
| Other values (29) | 2540961 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 5755821 | |
| & | 528658 | 8.4% |
| ' | 6283 | 0.1% |
| , | 4298 | 0.1% |
| \ | 1631 | < 0.1% |
| ? | 19 | < 0.1% |
| ; | 4 | < 0.1% |
| / | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 4778366 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1676769 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1676769 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 27912 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 32526700 | |
| Common | 14456531 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 2841039 | 8.7% |
| r | 2154959 | 6.6% |
| a | 1898912 | 5.8% |
| l | 1877752 | 5.8% |
| n | 1788431 | 5.5% |
| o | 1598443 | 4.9% |
| t | 1484295 | 4.6% |
| i | 1367987 | 4.2% |
| h | 1198850 | 3.7% |
| u | 1056566 | 3.2% |
| Other values (93) | 15259466 |
Common
| Value | Count | Frequency (%) |
| . | 5755821 | |
| 4778366 | ||
| ( | 1676769 | 11.6% |
| ) | 1676769 | 11.6% |
| & | 528658 | 3.7% |
| - | 27912 | 0.2% |
| ' | 6283 | < 0.1% |
| , | 4298 | < 0.1% |
| \ | 1631 | < 0.1% |
| ? | 19 | < 0.1% |
| Other values (2) | 5 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 46805325 | |
| None | 177906 | 0.4% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 5755821 | 12.3% |
| 4778366 | 10.2% | |
| e | 2841039 | 6.1% |
| r | 2154959 | 4.6% |
| a | 1898912 | 4.1% |
| l | 1877752 | 4.0% |
| n | 1788431 | 3.8% |
| ( | 1676769 | 3.6% |
| ) | 1676769 | 3.6% |
| o | 1598443 | 3.4% |
| Other values (54) | 20758064 |
None
| Value | Count | Frequency (%) |
| ü | 62724 | |
| é | 39116 | |
| ö | 26088 | |
| ä | 7921 | 4.5% |
| á | 7902 | 4.4% |
| Á | 7616 | 4.3% |
| ø | 4718 | 2.7% |
| ó | 3620 | 2.0% |
| Ø | 2882 | 1.6% |
| è | 2708 | 1.5% |
| Other values (41) | 12611 | 7.1% |
vernacularName
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 66.7% |
| Missing | 4515658 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 7.666666667 |
| Min length | 7 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 33.3% |
Sample
| 1st row | Holotype |
|---|---|
| 2nd row | Isotype |
| 3rd row | Holotype |
| Value | Count | Frequency (%) |
| holotype | 2 | |
| isotype | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 5 | |
| t | 3 | |
| y | 3 | |
| p | 3 | |
| e | 3 | |
| H | 2 | 8.7% |
| l | 2 | 8.7% |
| I | 1 | 4.3% |
| s | 1 | 4.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 20 | |
| Uppercase Letter | 3 | 13.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 5 | |
| t | 3 | |
| y | 3 | |
| p | 3 | |
| e | 3 | |
| l | 2 | 10.0% |
| s | 1 | 5.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 2 | |
| I | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 23 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 5 | |
| t | 3 | |
| y | 3 | |
| p | 3 | |
| e | 3 | |
| H | 2 | 8.7% |
| l | 2 | 8.7% |
| I | 1 | 4.3% |
| s | 1 | 4.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 5 | |
| t | 3 | |
| y | 3 | |
| p | 3 | |
| e | 3 | |
| H | 2 | 8.7% |
| l | 2 | 8.7% |
| I | 1 | 4.3% |
| s | 1 | 4.3% |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 4515660 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 17 |
|---|---|
| Median length | 17 |
| Mean length | 17 |
| Min length | 17 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Skog, Laurence E. |
|---|
| Value | Count | Frequency (%) |
| skog | 1 | |
| laurence | 1 | |
| e | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 11.8% | |
| e | 2 | 11.8% |
| S | 1 | 5.9% |
| k | 1 | 5.9% |
| o | 1 | 5.9% |
| g | 1 | 5.9% |
| , | 1 | 5.9% |
| L | 1 | 5.9% |
| a | 1 | 5.9% |
| u | 1 | 5.9% |
| Other values (5) | 5 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10 | |
| Uppercase Letter | 3 | 17.6% |
| Space Separator | 2 | 11.8% |
| Other Punctuation | 2 | 11.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 2 | |
| k | 1 | |
| o | 1 | |
| g | 1 | |
| a | 1 | |
| u | 1 | |
| r | 1 | |
| n | 1 | |
| c | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 1 | |
| L | 1 | |
| E | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1 | |
| . | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 13 | |
| Common | 4 | 23.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 2 | |
| S | 1 | |
| k | 1 | |
| o | 1 | |
| g | 1 | |
| L | 1 | |
| a | 1 | |
| u | 1 | |
| r | 1 | |
| n | 1 | |
| Other values (2) | 2 |
Common
| Value | Count | Frequency (%) |
| 2 | ||
| , | 1 | |
| . | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 11.8% | |
| e | 2 | 11.8% |
| S | 1 | 5.9% |
| k | 1 | 5.9% |
| o | 1 | 5.9% |
| g | 1 | 5.9% |
| , | 1 | 5.9% |
| L | 1 | 5.9% |
| a | 1 | 5.9% |
| u | 1 | 5.9% |
| Other values (5) | 5 |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 4515659 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 51 |
|---|---|
| Median length | 49.5 |
| Mean length | 49.5 |
| Min length | 48 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Plantae, Monocotyledonae, Poales, Poaceae, Pooideae |
|---|---|
| 2nd row | Plantae, Pteridophyte, Polypodiales, Athyriaceae |
| Value | Count | Frequency (%) |
| plantae | 2 | |
| monocotyledonae | 1 | |
| poales | 1 | |
| poaceae | 1 | |
| pooideae | 1 | |
| pteridophyte | 1 | |
| polypodiales | 1 | |
| athyriaceae | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 14 | |
| a | 12 | |
| o | 11 | |
| 7 | 7.1% | |
| , | 7 | 7.1% |
| P | 7 | 7.1% |
| t | 6 | 6.1% |
| l | 6 | 6.1% |
| n | 4 | 4.0% |
| y | 4 | 4.0% |
| Other values (9) | 21 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 76 | |
| Uppercase Letter | 9 | 9.1% |
| Space Separator | 7 | 7.1% |
| Other Punctuation | 7 | 7.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 14 | |
| a | 12 | |
| o | 11 | |
| t | 6 | |
| l | 6 | |
| n | 4 | 5.3% |
| y | 4 | 5.3% |
| d | 4 | 5.3% |
| i | 4 | 5.3% |
| c | 3 | 3.9% |
| Other values (4) | 8 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 7 | |
| M | 1 | 11.1% |
| A | 1 | 11.1% |
Space Separator
| Value | Count | Frequency (%) |
| 7 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 7 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 85 | |
| Common | 14 | 14.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 14 | |
| a | 12 | |
| o | 11 | |
| P | 7 | |
| t | 6 | |
| l | 6 | |
| n | 4 | 4.7% |
| y | 4 | 4.7% |
| d | 4 | 4.7% |
| i | 4 | 4.7% |
| Other values (7) | 13 |
Common
| Value | Count | Frequency (%) |
| 7 | ||
| , | 7 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 99 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 14 | |
| a | 12 | |
| o | 11 | |
| 7 | 7.1% | |
| , | 7 | 7.1% |
| P | 7 | 7.1% |
| t | 6 | 6.1% |
| l | 6 | 6.1% |
| n | 4 | 4.0% |
| y | 4 | 4.0% |
| Other values (9) | 21 |
taxonRemarks
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 4515659 |
| Missing (%) | > 99.9% |
| Memory size | 34.5 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Plantae |
|---|---|
| 2nd row | Plantae |
| Value | Count | Frequency (%) |
| plantae | 2 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 4 | |
| P | 2 | |
| l | 2 | |
| n | 2 | |
| t | 2 | |
| e | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12 | |
| Uppercase Letter | 2 | 14.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4 | |
| l | 2 | |
| n | 2 | |
| t | 2 | |
| e | 2 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 14 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 4 | |
| P | 2 | |
| l | 2 | |
| n | 2 | |
| t | 2 | |
| e | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 4 | |
| P | 2 | |
| l | 2 | |
| n | 2 | |
| t | 2 | |
| e | 2 |